I am trying to update my Rancher cluster using “rke up”, as recommended on https://rancher.com/docs/rke/latest/en/upgrades/. Problem is, it complains about 1 of 3 nodes:
: INFO[0189] Waiting for [kubelet] container to exit on host [r03.example.com] INFO[0189] Renaming container [kubelet] to [old-kubelet] on host [r03.example.com], try #1 INFO[0189] Starting container [kubelet] on host [r03.example.com], try #1 INFO[0189] Waiting for [kubelet] container to exit on host [r02.example.com] INFO[0189] Renaming container [kubelet] to [old-kubelet] on host [r02.example.com], try #1 INFO[0189] Waiting for [kubelet] container to exit on host [r01.example.com] INFO[0189] Renaming container [kubelet] to [old-kubelet] on host [r01.example.com], try #1 INFO[0189] Starting container [kubelet] on host [r02.example.com], try #1 INFO[0189] [worker] Successfully updated [kubelet] container on host [r03.example.com] INFO[0189] Removing container [old-kubelet] on host [r03.example.com], try #1 INFO[0189] Starting container [kubelet] on host [r01.example.com], try #1 INFO[0189] [healthcheck] Start Healthcheck on service [kubelet] on host [r03.example.com] INFO[0190] [worker] Successfully updated [kubelet] container on host [r02.example.com] INFO[0190] Removing container [old-kubelet] on host [r02.example.com], try #1 INFO[0190] [worker] Successfully updated [kubelet] container on host [r01.example.com] INFO[0190] Removing container [old-kubelet] on host [r01.example.com], try #1 INFO[0190] [healthcheck] Start Healthcheck on service [kubelet] on host [r02.example.com] INFO[0190] [healthcheck] Start Healthcheck on service [kubelet] on host [r01.example.com] FATA[0241] [workerPlane] Failed to bring up Worker Plane: [Failed to verify healthcheck: Failed to check http://localhost:10248/healthz for service [kubelet] on host [r03.example.com]: Get http://localhost:10248/healthz: Unable to access the service on localhost:10248. The service might be still starting up. Error: ssh: rejected: connect failed (Connection refused), log: I0326 06:52:19.557779 15320 balancer_conn_wrappers.go:127] pickfirstBalancer: HandleSubConnStateChange: 0xc0003d4040, READY]
AFAICT r03 is not starting up, so I wonder what could be done to fix this problem?