We are running a setup with Rancher v1.1.4. Most of the times our setup is working as expected, however at times we see issues where containers across different hosts aren’t able to communicate with each other.
During such times ping from one container to another fails with 100% packet loss. The only way around is to restart network agent on troublesome host.
While trying to debug the issue I also found that there were lots of ICMP redirects happening even when containers were happily communicating with each other. Should this be of some concern as far as troubleshooting the issue is concerned?
I am completely clueless here, as network agent logs don’t reveal much and I am unable to make any sense from those ICMP redirects ?
Thanks much in advance for any help around this matter.
Cheers,
M