Karmada Metrics Reference
Metrics
This section details the metrics that different Karmada components export. You can query the metrics endpoint for these components using an HTTP scrape, and fetch the current metrics data in Prometheus format.
Name | Type | Help | Labels | Source Components |
---|---|---|---|---|
schedule_attempts_total | Counter | Number of attempts to schedule resourceBinding. | result schedule_type | karmada-scheduler |
e2e_scheduling_duration_seconds | Histogram | E2E scheduling latency in seconds. | result schedule_type | karmada-scheduler |
scheduling_algorithm_duration_seconds | Histogram | Scheduling algorithm latency in seconds(excluding scale scheduler). | schedule_step | karmada-scheduler |
queue_incoming_bindings_total | Counter | Number of bindings added to scheduling queues by event type. | event | karmada-scheduler |
framework_extension_point_duration_seconds | Histogram | Latency for running all plugins of a specific extension point. | extension_point result | karmada-scheduler |
plugin_execution_duration_seconds | Histogram | Duration for running a plugin at a specific extension point. | plugin extension_point result | karmada-scheduler |
estimating_request_total | Counter | Number of scheduler estimator requests. | result type | karmada_scheduler_estimator |
estimating_algorithm_duration_seconds | Histogram | Estimating algorithm latency in seconds for each step. | result type step | karmada_scheduler_estimator |
cluster_ready_state | Gauge | State of the cluster(1 if ready, 0 otherwise). | cluster_name | karmada-controller-manager karmada-agent |
cluster_node_number | Gauge | Number of nodes in the cluster. | cluster_name | karmada-controller-manager karmada-agent |
cluster_ready_node_number | Gauge | Number of ready nodes in the cluster. | cluster_name | karmada-controller-manager karmada-agent |
cluster_memory_allocatable_bytes | Gauge | Allocatable cluster memory resource in bytes. | cluster_name | karmada-controller-manager karmada-agent |
cluster_cpu_allocatable_number | Gauge | Number of allocatable CPU in the cluster. | cluster_name | karmada-controller-manager karmada-agent |
cluster_pod_allocatable_number | Gauge | Number of allocatable pods in the cluster. | cluster_name | karmada-controller-manager karmada-agent |
cluster_memory_allocated_bytes | Gauge | Allocated cluster memory resource in bytes. | cluster_name | karmada-controller-manager karmada-agent |
cluster_cpu_allocated_number | Gauge | Number of allocated CPU in the cluster. | cluster_name | karmada-controller-manager karmada-agent |
cluster_pod_allocated_number | Gauge | Number of allocated pods in the cluster. | cluster_name | karmada-controller-manager karmada-agent |
cluster_sync_status_duration_seconds | Histogram | Duration in seconds for syncing the status of the cluster once. | cluster_name | karmada-controller-manager karmada-agent |
resource_match_policy_duration_seconds | Histogram | Duration in seconds to find a matched propagation policy for the resource template. | / | karmada-controller-manager |
resource_apply_policy_duration_seconds | Histogram | Duration in seconds to apply a propagation policy for the resource template. By the result, ‘error’ means a resource template failed to apply the policy. Otherwise ‘success’. | result | karmada-controller-manager |
policy_apply_attempts_total | Counter | Number of attempts to be applied for a propagation policy. By the result, ‘error’ means a resource template failed to apply the policy. Otherwise ‘success’. | result | karmada-controller-manager |
binding_sync_work_duration_seconds | Histogram | Duration in seconds to sync works for a binding object. By the result, ‘error’ means a binding failed to sync works. Otherwise ‘success’. | result | karmada-controller-manager |
work_sync_workload_duration_seconds | Histogram | Duration in seconds to sync the workload to a target cluster. By the result, ‘error’ means a work failed to sync workloads. Otherwise ‘success’. | result | karmada-controller-manager karmada-agent |
policy_preemption_total | Counter | Number of preemption for the resource template. By the result, ‘error’ means a resource template failed to be preempted by other propagation policies. Otherwise ‘success’. | result | karmada-controller-manager |
cronfederatedhpa_process_duration_seconds | Histogram | Duration in seconds to process a CronFederatedHPA. By the result, ‘error’ means a CronFederatedHPA failed to be processed. Otherwise ‘success’. | result | karmada-controller-manager |
cronfederatedhpa_rule_process_duration_seconds | Histogram | Duration in seconds to process a CronFederatedHPA rule. By the result, ‘error’ means a CronFederatedHPA rule failed to be processed. Otherwise ‘success’. | result | karmada-controller-manager |
federatedhpa_process_duration_seconds | Histogram | Duration in seconds to process a FederatedHPA. By the result, ‘error’ means a FederatedHPA failed to be processed. Otherwise ‘success’. | result | karmada-controller-manager |
federatedhpa_pull_metrics_duration_seconds | Histogram | Duration in seconds taken by the FederatedHPA to pull metrics. By the result, ‘error’ means the FederatedHPA failed to pull the metrics. Otherwise ‘success’. | result metricType | karmada-controller-manager |
pool_get_operation_total | Counter | Total times of getting from pool | name from | karmada-controller-manager karmada-agent |
pool_put_operation_total | Counter | Total times of putting from pool | name to | karmada-controller-manager karmada-agent |
recreate_resource_to_cluster | Counter | Number of resource recreations to a target member cluster. By the result, ‘error’ means a resource recreated failed. Otherwise ‘success’. Cluster means the target member cluster. | result apiversion kind cluster | karmada-controller-manager |
update_resource_to_cluster | Counter | Number of resource updates to a target member cluster. By the result, ‘error’ means a resource updated failed. Otherwise ‘success’. Cluster means the target member cluster. | result apiversion kind cluster | karmada-controller-manager |