Kubernetes

Apache Airflow aims to be a very Kubernetes-friendly project, and many users run Airflow from within a Kubernetes cluster in order to take advantage of the increased stability and autoscaling options that Kubernetes provides.

Helm Chart for Kubernetes

We maintain official Helm chart for Airflow that helps you define, install, and upgrade deployment. The Helm Chart uses official Docker image and Dockerfile that is also maintained and released by the community.

Kubernetes Executor

The Kubernetes Executor allows you to run all the Airflow tasks on Kubernetes as separate Pods.

KubernetesPodOperator

The KubernetesPodOperator allows you to create Pods on Kubernetes.

Pod Mutation Hook

The Airflow local settings file (airflow_local_settings.py) can define a pod_mutation_hook function that has the ability to mutate pod objects before sending them to the Kubernetes client for scheduling. It receives a single argument as a reference to pod objects, and is expected to alter its attributes.

This could be used, for instance, to add sidecar or init containers to every worker pod launched by KubernetesExecutor or KubernetesPodOperator.

  1. from kubernetes.client.models import V1Pod
  2. def pod_mutation_hook(pod: V1Pod):
  3. pod.metadata.annotations["airflow.apache.org/launched-by"] = "Tests"