Pods on Worker Node not reachable by 10.41.. IPs - Flannel down?

kkobald · April 29, 2020, 12:48pm

A few days ago some webites on my cluster where not available. I found out that pinging any of the 10.41.* IPs of the worker node did not work. Since I did not know what to do I restarted all servers. That solved the problem.
I am using flannel as network layer.

Q1:
How can I setup an alert, that would warn me about such basic things? I have Prometheus running on a clusterlevel and get etcd warnings once in a while that resolve themselves.

Q2:
I could see an increase of ingress response times in the Kubernets Components Metrics at that very time. How can a get alerts about that?

Q3:
Can I do this on a clusterlevel? I do not want to manage monitoring for every single project.

I found the alert/monitoring doc but that´s not very helpfull.

Topic		Replies	Views
Unable to ping from a pod Rancher	1	802	November 8, 2022
Load Balancing / Ingress - New worker node not reachable Rancher	0	1097	May 26, 2018
Kube-flannel fails on multiple nodes Rancher 2.0 Tech Preview	0	2429	April 17, 2018
Kubelet stopped posting node status Rancher	5	8798	February 27, 2023
Kubelet can't create a pod, flannel used with specific pod network Rancher	3	1036	March 10, 2020

Pods on Worker Node not reachable by 10.41.*.* IPs - Flannel down?

Related topics

Pods on Worker Node not reachable by 10.41.. IPs - Flannel down?