Hi, I’m new at both rancher and docker, but I noticed a high load in zabbix on my 3 nodes.
After debugging for two days I came across the network-services/metadata stack (new one in Infrastructure) and saw that the CPU load on all metadata services is around 200% or more.
This is the log:
16/02/2017 15:31:36time=“2017-02-16T14:31:36Z” level=info msg="Downloaded in 8.4477361s"16/02/2017 15:31:38time=“2017-02-16T14:31:38Z” level=info msg="Loading answers"16/02/2017 15:31:40time=“2017-02-16T14:31:40Z” level=info msg="Loaded answers"
And that happens every second.
The load is around 15-25 per server. More if something is upgraded or started in rancher.
Anyone know where I should look next? Googling didn’t yield any answers.
This seems really old, but I’m facing a similar issue using Rancher v1.5.10. When I restart or upgrade e Route53 DNS service, the metadata service CPU usage goes up (the network-services-metadata-dns container).
Short term no, when things change metadata is updated and the complete yaml file is sent to the host and parsed. Longer term (2.0) yes, diffs will be/are sent incrementally instead of the complete file.
It really seems some unnecessary work is being made. I imagine that the complete yaml file is not that large to incur a network usage of 150-200Mbits/s for a few seconds. Also looking forward for the news at 2.0.
We have 70 services right now, and we are growing every day. For the moment we need to change the architecture so this wont be happening.