Configuring the requests per second (RPS) target
This setting specifies a target for requests-per-second per replica of an application. Your revision must also be configured to use the rps
metric annotation.
- Global key:
requests-per-second-target-default
- Per-revision annotation key:
autoscaling.knative.dev/target
- Possible values: An integer.
- Default:
"200"
Example:
Per RevisionGlobal (ConfigMap)Global (Operator)
apiVersion: serving.knative.dev/v1
kind: Service
metadata:
name: helloworld-go
namespace: default
spec:
template:
metadata:
annotations:
autoscaling.knative.dev/target: "150"
autoscaling.knative.dev/metric: "rps"
spec:
containers:
- image: ghcr.io/knative/helloworld-go:latest
apiVersion: v1
kind: ConfigMap
metadata:
name: config-autoscaler
namespace: knative-serving
data:
requests-per-second-target-default: "150"
apiVersion: operator.knative.dev/v1alpha1
kind: KnativeServing
metadata:
name: knative-serving
spec:
config:
autoscaler:
requests-per-second-target-default: "150"