Bunch of containers looping "Updating-Active"

ybizeul · August 18, 2018, 4:18pm

Not sure what is going on, or how to debug this.

A bunch of containers are just doing “Updating-Active” every minute or so with no apparent reason. It causes the external DNS service to remove/recreate the entries and I suspect rancher is overutilizing CPU as well.

Here is an example for one of the containers, just to be clear, the containers aren’t restarted, it’s just the status that bounces.

ybizeul · August 21, 2018, 6:00pm

It turns out there was a couple containers that were unhealthy.

These containers were not visible under the “Stacks” menu, even with Infrastructure services displayed and all frames open.

But when I went to the “Infrastructure” main menu, under “Containers” (the flat list of all the containers in Rancher) I saw here that a newer version of dnsupdate-rfc2136 was Unhealthy, and also a second instance of scheduler-scheduler-1 (same version).

After deleting these two containers everything was back to normal.

ybizeul · August 22, 2018, 9:43am

I take that back, it started again and nothing to be seen in any logs… like very often in Rancher, some random issue appears out of nowhere with no way to troubleshoot.

ybizeul · August 23, 2018, 7:56am

So it turns out the issue was very complex, and Rancher really doesn’t make easy to troubleshoot.

It came down to the fact that I had my external dns service updating a name that has been defined manually in the DNS server. The name involved was the one of the load balancer.
The broken dns update probably caused a service metadata change on the load balancer that in turn caused service updates on the containers it was in front of.

Short story : if you have something similar happening to you, and you use both DNS update and load balancer, you might want to there.

Topic		Replies	Views
[rancher-dns] dns server & dns update stack create wrong entries ( all updates get dns server IP instead of container IP ) Rancher 1.x	0	878	June 7, 2018
Urgent help needed with rancher server wonkiness Rancher 1.x	10	2911	June 27, 2018
Bug in rancher server 1.5.1 Rancher 1.x	2	843	March 21, 2017
Rancher not updating network Rancher 1.x	0	683	October 7, 2015
Auto restart sometimes not working Rancher 1.x	3	2512	March 25, 2019

Bunch of containers looping "Updating-Active"

Related topics