We are trying to upgrade an rabbit cluster that is built as a single service in Rancher (1.0.1). We have an appropriate health check in place and are using batch size of 1 and an “in place” upgrade.
The behavior we’d like/expect to see is that each container gets a healthy replacement before continuing.
The behavior we are seeing is that only a single container in healthy is kept around, and while containers are in the “initializing” state, the upgrade continues on. Therefore, we lose quorum as the cluster goes down to one node during upgrade.