We have now a 150 host setup and maybe 3k containers and sometimes the scheduler service crashes, taking the rancher to a high load, healthchecks and disconnects problems for several minutes
We want to increase the number of schedulers instances (2 or 3), but are not really sure if we can do that…
So the question is that, can we have more than one scheduler instance running?
The scheduler crash is related to timeout talking with the metadata, one similar bug was fixed in the last few versions, but there are still other problems related to this