Best practice when rebooting Longhorn hosts

shubbard343 · October 1, 2018, 10:38pm

What are the best practices when rebooting a Longhorn host?

History:
We have a 3-node Kubernetes cluster running Rancher 2.0 on top. We had to reboot one of the hosts for maintenance, so we drained the node from Kubernetes, and powered it down. When we powered it back up, we noticed in Longhorn that for our volumes which had 3 replicas specified, two replicas were on one host, and the third replica was on the second host. Longhorn did not migrate the replica back to the third host after it came back up.

Is there a way to tell Longhorn to always keep the replicas on different hosts?

yasker · October 1, 2018, 10:58pm

Longhorn would try to prevent disrupting the data path unless it’s necessary (e.g. one host has been lost). In your case, the replica on the powered down host is considered lost because it’s not reachable when you reboot the host, so Longhorn has rebuilt the replica on another available host to keep volume healthy.

Longhorn won’t automatically detect the availability of the new host and move the load there automatically. Maybe we can add a feature for that later to auto balancing the load between nodes. But it would be pretty costly to migrate the data constantly.

For now, since you have 3 replicas, you can deliberately delete one of the two replicas on the same node. It should trigger the rebuilding process of Longhorn and the system will take a look at current status of the nodes, and it should decide that the third node is a better place for the new replica due to the soft anti-affinity rule we’ve set.

After we add support for update replica counts for the volume (https://github.com/rancher/longhorn/issues/299) you should able to increase the replica count temporarily to trigger the rebuilding process, then decrease the replica count to the normal, and finally remove the extra replicas.

shubbard343 · October 1, 2018, 11:55pm

For now, since you have 3 replicas, you can deliberately delete one of the two replicas on the same node. It should trigger the rebuilding process of Longhorn

That’s what we did to get it back to the third node. The problem is that if we have dozens or more volumes, it is going to be a lot of work to delete all of the replicas to get them moved back. And it’s un-necessary writing and deleting to the drives for a temporary state change.

What would be nice if there was a hold-timer that would prevent the rebuilding of a lost volume for a period of time. Or something similar to the CEPH command ceph osd set noout so that it won’t rebuild the replica if you are doing maintenance.

http://docs.ceph.com/docs/master/rados/troubleshooting/troubleshooting-osd/#stopping-w-out-rebalancing

yasker · October 3, 2018, 7:23pm

In Longhorn’s design, the replicas are not supposed to be lost at any point, otherwise, it will become unhealthy and trigger Longhorn’s rebuild.

Temporarily stop the rebuild is doable, but after the node reboots, Longhorn cannot use the same replica, because it’s already out of sync with others.

Can you help to file an issue on https://github.com/rancher/longhorn/issues ? I’d like to keep track of this and think more about it, along with https://github.com/rancher/longhorn/issues/298 .

shubbard343 · October 3, 2018, 7:35pm

github.com/longhorn/longhorn

[FEATURE] Pause replica rebuild for server maintenance

opened 07:34PM - 03 Oct 18 UTC

closed 01:20PM - 29 Apr 21 UTC

shubb30

kind/enhancement area/manager highlight priority/1 require/automation-e2e require/doc

This is a follow up to https://forums.rancher.com/t/best-practice-when-rebooting…-longhorn-hosts/11899 Issue: When performing planned maintenance on a Longhorn node, as soon as the system is shut down, Longhorn will rebuild any replicas on the host onto a remaining host. In the case where the number of replicas is equal to the number of hosts, Longhorn will end up with two replicas on one host, and one on the other host. When the rebooted host comes back, Longhorn does not relocate the duplicated replica back to the rebooted host. The user has to manually delete the replica from the host that has two, and then Longhorn will rebuild it on the rebooted host. Proposed solution: Add an option similar to the CEPH command `ceph osd set noout` so that replicas won't be rebuilt if a server is shut down for maintenance. As Sheng pointed out, the replicas on the rebooted node will be out of sync and will still need to be rebuilt, but at least it will not have to replicate the data to another node, only to have it be deleted to get moved back. http://docs.ceph.com/docs/master/rados/troubleshooting/troubleshooting-osd/#stopping-w-out-rebalancing

Topic		Replies	Views
Update or edit workloads with longhorn volumes in multi node cluster Longhorn	3	1843	November 29, 2018
Restore backup by via longhorn Longhorn	1	1612	October 11, 2022
About replica rebuilding Longhorn	2	2001	June 19, 2018
Harvester host redundant disk configuration Harvester	5	1644	October 22, 2024
Rancher settings wiped - can i re-connect to the cluster? Rancher	0	469	January 30, 2019

Best practice when rebooting Longhorn hosts

Related topics