Edit Autoscaling Settings

This section explains how to edit autoscaling settings for workloads.

The autoscaling feature allows the system to automatically adjust the pod replica count based on the target CPU and memory usage of all replicas in the workload.

Prerequisites

You should join a cluster and have the Application Workload Management permission within the cluster. For more information, refer to “Cluster Members” and “Cluster Roles”.

Steps

  1. Log in to the KubeSphere web console with a user who has the Application Workload Management permission, and access your cluster.

  2. Click Application Workloads > Workloads in the left navigation pane.

  3. On the Workloads page, click Deployments, StatefulSets, or DaemonSets, then click the name of a workload in the list to open its details page.

  4. On the top left of the workload details page, select More > Edit Autoscaling.

  5. In the Autoscaling dialog, set the autoscaling parameters for the workload, then click OK.

    ParameterDescription

    Target CPU Usage

    Target CPU usage for all pod replicas in the workload, measured in percentage. When the actual CPU usage is higher/lower than the target value, the system automatically decreases/increases the replica count.

    Target Memory Usage

    Target memory usage for all pod replicas in the workload, measured in MiB. When the actual memory usage is higher/lower than the target value, the system automatically decreases/increases the replica count.

    Minimum Replicas

    The minimum allowed pod replica count, with a default value of 1.

    Maximum Replicas

    The maximum allowed pod replica count, with a default value of 1.