HA for master nodes

kamlesh · October 4, 2019, 11:14am

hi, i am very new to rancher and kubernetes. I want to create HA of multiple master nodes using rancher GUI?
please, let me know the procedure.

thanks

superseb · October 4, 2019, 11:40am

HA install for Rancher is documented at https://rancher.com/docs/rancher/v2.x/en/installation/ha/, creating production ready clusters within Rancher is documented at https://rancher.com/docs/rancher/v2.x/en/cluster-provisioning/production/

kamlesh · October 7, 2019, 5:04am

just want to confirm that whether Layer 4 load balancer(tcp) works perfectly fine on VM machines? I read somewhere it won’t supports. only layer 7 loadbalancer supports this.

yeti · October 7, 2019, 4:21pm

Create your cluster using RKE, andin your cluster.yaml declare three nodes that are control plane. Its that easy.

Ex:

nodes:

address: “yourserver1.dev.yourcompany.com”
port: “22”
role: [etcd,controlplane]
user: rancher
address: “yourserver2.dev.yourcompany.com”
port: “22”
role: [etcd,controlplane]
user: rancher
address: “yourserver3.dev.yourcompany.com”
port: “22”
role: [etcd,controlplane]
user: rancher

##then declare al your worker nodes

Fraser_Goffin · October 8, 2019, 7:50am

An L4 load balancer definitely works and is recommended. We deploy to AWS and use an NLB.

kamlesh · October 9, 2019, 4:32am

hello yeti,
thanks for immediate reply.
As of now I have created 2 master nodes and 1 worker node.
now i have to test certain cases like :

if my first master node gets down, then whether the second master node is able to take entire load? moreover, i need to confirm is it ok to test this with 2 master node or it requires 3 master nodes ?

superseb · October 9, 2019, 9:16am

Please read the documentation linked, in https://rancher.com/docs/rancher/v2.x/en/cluster-provisioning/production/#count-of-etcd-nodes it clearly states 2 etcd nodes does not give you fault tolerance.

yeti · October 9, 2019, 5:24pm

@ [kamlesh] It is generally good practice to always use an odd number of masters, as the control-plane nodes perform leader elections.

Leader election is the mechanism that guarantees that only one instance of the kube-scheduler — or one instance of the kube-controller-manager — is actively making decisions, while all the other instances are inactive, but ready to take leadership if something happens to the active one.

JeepGuy · October 11, 2019, 9:58am

THANK YOU !!!
I thought so but all the K8s docs say you only need two Masters… Do you have any reference to validate that the Masters perform leader election?
Jim

javierriera97 · October 11, 2019, 1:57pm

There you go! https://rancher.com/docs/rancher/v2.x/en/troubleshooting/kubernetes-resources/#kubernetes-controller-manager-leader

yeti · October 11, 2019, 2:55pm

https://medium.com/michaelbi-22303/deep-dive-into-kubernetes-simple-leader-election-3712a8be3a99 & others. Just google it

vincent · October 12, 2019, 8:32pm

Words are getting conflated here. There is nothing we call a “master” in Rancher, nodes have the “control plane” or “etcd” role.

etcd has leader election and a "master " inside of itself. You should always have an odd number etcd nodes. There is no reason to ever have an even number except temporarily during a failure or on the way up (or down) to the next odd number; even is strictly worse than odd. And 2 is the absolute worst number to have, because you still have no fault tolerance (if either goes down you have no quorum) but have introduced twice as many hard drives, power supplies, NICs, DIMMs, CPUs etc that could fail.

Control plane nodes talk to etcd, provide the API, and tell worker nodes to do things. More than one provides redundancy in case one fails (and can sometimes horizontally scale load). You do not need an odd number of them. If you have more than one then you need a load balancer or DNS round-robin to distribute requests from users/nodes to the healthy control plane nodes.

kamlesh · October 14, 2019, 4:39am

hi,

i am little bit confuse regarding number of control plane and etcd required for HA of master. currently i have updated my cluster with 3 master node (each having 1 etcd role and 1 control plane role) and 1 worker node (which has 1 worker role only).
is it right to move forward ?
or some ground level changes still required before start with installation.

vincent · October 14, 2019, 5:03am

Again, there is nothing called a “master”. To survive the failure of any one node, you want:

3 or 5 nodes with the etcd roles
2 or more control plane
2 or more worker

A single node can have one or more of those roles (i.e. 3 nodes with all 3 roles satisfies the above). Combining etcd and control plane together is common.

kamlesh · October 17, 2019, 3:48am

can we put roles (etcd, control plane, and worker ) on the same node? will they work fine ?

Fraser_Goffin · October 22, 2019, 12:20pm

Yes that will work. You may want to think about the potential consequences though, ie you have less resilience and the possibility that problems with one component will adversely impact the the others. There is also clearly a difference in how you scale this set up if you were to find that any of the components have different resource usage profiles than others (hint, they do).

Anyway, your requirements are your own so that’s what should inform your choices. Technically speaking multi-role nodes are definitely supported.

kamlesh · October 29, 2019, 7:47am

hey, i m using this link:
(https://rancher.com/docs/rancher/v2.x/en/installation/ha/)
for rancher HA.
here it is mentioned that it is required to install tools namely : RKE, kubectl,helm. As per the doc we are installing kubernetes using RKE. so below are my queries regarding the same :

on which nodes (like i have 1 load balancer node, 3 ingress controller nodes, 1 worker node) these tools (RKE, Kubectl, helm) are to be installed?
if kubernetes is installed using RKE then is it necessary to install kubectl separately on each node?

Fraser_Goffin · November 5, 2019, 9:20pm

Those are client side tools so whilst you may choose to install them on your worker of management nodes, more typically you will use whatever your CI/CD platform of choice is to create deployment pipelines that could use helm, vanilla kubectl and rke. Helm 2 is slightly different in the sense that you can install tiller on your nodes. However that’s not a requirement and many people today regard tiller as a security vulnerability (although it is possible to mitigate that in a number of ways). Personally speaking, we have already moved over to Helm 3 which has recently moved to release candidate status.

jpeake · November 6, 2019, 5:33pm

Also it is important to understand if you are referring to HA for Rancher Server itself (the “local” cluster) or you already have Rnacher running and are creating a workload cluster.

For Rancher HA cluster, you will have all three roles on each node (and should have three nodes, or 5,7,9 if you wanna get crazy). But only Rancher Server runs on this cluster (plus the K8s components)

For a workload cluster managed by Rancher, a common config is 3 nodes with “etc” and “control plane” and then additional nodes with only “worker”.

kamlesh · November 14, 2019, 5:21am

hey, thanks guys for your support.
I have deployed the HA rancher successfully. let me tell you about the cluster that I formed

1 load balancer which is a separated node.
3 ingress nodes (having roles etcd, controlpane) I configured.
1 worker node.

now as per the docs
setup is done successfully, you can view the status of the pods.

[high@loadbalancer creating_cluster]$ kubectl -n cattle-system get pods
NAME READY STATUS RESTARTS AGE
rancher-85498c4d67-jncjx 1/1 Running 8 7d15h
rancher-85498c4d67-mtvb2 1/1 Running 8 7d15h
rancher-85498c4d67-trmtw 1/1 Running 9 7d15h

*note i have done below changes

disabled and stop firewalld service on all 5 nodes.
changed the web port for ngnix from 80 to other random port.

now I need to know how can I open rancher web portal?
i am trying using IP address of one of the ingress controller node.
but getting error : connection refused.

please help.

Topic		Replies	Views
Provision Multiple master node Rancher	3	2973	August 28, 2019
How are folks approaching HA with k8s clusters in production? Rancher	5	1700	June 19, 2019
Rancher HA setup Rancher	5	1006	May 17, 2019
Rancher 2.0 HA Architecture Question	0	657	June 19, 2018
Deploying HA kubernetes clusters Rancher	3	1212	October 25, 2018

HA for master nodes

Related topics