rgbj
#1
Hi,
I’d like to get kubernetes to detect failed nodes and recover faster by using the parameters outlined here: https://fatalfailure.wordpress.com/2016/06/10/improving-kubernetes-reliability-quicker-detection-of-a-node-down/. Namely, the article says:
kubelet: node-status-update-frequency=4s (from 10s)
controller-manager: node-monitor-period=2s (from 5s)
controller-manager: node-monitor-grace-period=16s (from 40s)
controller-manager: pod-eviction-timeout=30s (from 5m)
Is there a way with a rancher-based kubernetes installation to pass these parameters to the relevant components?
TIA for any help
1 Like
laurent
#2
Hi,
You can create your own template to do this or modify the existing.
Please have a look to the docker-compose.yaml file you will get it 
controller-manager:
command:
- kube-controller-manager
- --master=https://kubernetes.kubernetes.rancher.internal:6443
- --cloud-provider=${CLOUD_PROVIDER}
- --address=0.0.0.0
- --kubeconfig=/etc/kubernetes/ssl/kubeconfig
- --root-ca-file=/etc/kubernetes/ssl/ca.pem
- --service-account-private-key-file=/etc/kubernetes/ssl/key.pem
image: rancher/k8s:v1.5.1-rancher1-7
labels:
io.rancher.scheduler.affinity:host_label_soft: orchestration=true
io.rancher.container.create_agent: "true"
io.rancher.container.agent.role: environmentAdmin
links:
- kubernetes
Laurent
rgbj
#3
Thanks for your answer, I now see where I should look.