High Availability

High Availability

For production workloads, Linkerd’s control plane can run in high availability(HA) mode. This mode:

Runs three replicas of critical control plane components.
Sets production-ready CPU and memory resource requests on control planecomponents.
Sets production-ready CPU and memory resource requests on data plane proxies
Requires that the proxy auto-injector befunctional for any pods to be scheduled.
Sets anti-affinitypolicieson critical control plane components to achieve, if possible, that they arescheduled on separate nodes and in separate zones by default. Optionally,the scheduling on separate nodes can be made a hard requirement using a flag.

Enabling HA

You can enable HA mode at control plane installation time with the —ha flag:

linkerd install --ha | kubectl apply -f

You can override certain aspects of the HA behavior at installation time bypassing other flags to install. For example, you can override the number ofreplicas for critical components with the —controller-replicas flag:

linkerd install --ha --controller-replicas=2 | kubectl apply -f

To ensure that Linkerd does not get in the way of system critical workloadsstarting, the proxy injector should be disabled for the kube-system namespaceby running:

kubectl label namespace kube-system config.linkerd.io/admission-webhooks=disabled

See the full install CLI documentation forreference.

Critical components

Replication and anti-affinity rules are applied to all controlplane components except Prometheus, Grafana, and the web service, which areconsidered non-critical.

Caveats

HA mode assumes that there are always at least three nodes in the Kubernetescluster. If this assumption is violated (e.g. the cluster is scaled down totwo or fewer nodes), then the system will likely be left in a non-functionalstate.