How to set interval stopping old container when doing in-service upgrade, (for handling zero downtime)

abudargo · May 1, 2017, 3:56pm

Hello, i wonder how to set interval stopping old contianer when doing in-service upgrade?

my goal purpose is for having zero downtime service upgrade using Rancher.

current implementation:

i use Rancher HAproxy as load balancer + kong (nginx-based) as proxy server + backend microservices (lets say it service A)
i use in-service upgrading, using start_first upgrade strategy

issue that i found:

i found my backend service (backend service is proxied by kong) still got 503 when doing upgrade with this strategy (start new container first, then stop old container)
old container instantly stopped after new container started
i found other service (kong) already could resolving backend service hostname to its new container ip address (i ssume it proves Rancher HAproxy update the network instantly)
but on kong (nginx) error log, its backend service upstream still resolving to old (stopped) container ip address, so it got unreacheable 503
i assume it is because dns-caching in nginx engine
so by reloading kong process, my backend service got 200 again (it clears the nginx dns-cache)

So based on this finding:

i wonder if i could set interval period for pending the old container stopped, after new container started, it would solve my problem (for let the kong dns-caching expired) ??
is there any strategy for my service upgrading?

Topic		Replies	Views
Wait time before killing containers during service upgrades Rancher 1.x	1	1186	February 20, 2016
Mechanics of rolling-update: is it robust? Rancher 1.x	1	952	July 26, 2016
Best way for zero downtime during Rancher version upgrades (with Cattle) Rancher 1.x	0	1353	March 3, 2017
Rancher upgrade service - download before restart Rancher 1.x	4	1101	October 21, 2016
Service Upgrade - stops too fast Rancher 1.x	4	1513	January 15, 2016