system.replicas

Contains information and status for replicated tables residing on the local server.
This table can be used for monitoring. The table contains a row for every Replicated* table.

Example:

  1. SELECT *
  2. FROM system.replicas
  3. WHERE table = 'visits'
  4. FORMAT Vertical
  1. Row 1:
  2. ──────
  3. database: merge
  4. table: visits
  5. engine: ReplicatedCollapsingMergeTree
  6. is_leader: 1
  7. can_become_leader: 1
  8. is_readonly: 0
  9. is_session_expired: 0
  10. future_parts: 1
  11. parts_to_check: 0
  12. zookeeper_path: /clickhouse/tables/01-06/visits
  13. replica_name: example01-06-1.yandex.ru
  14. replica_path: /clickhouse/tables/01-06/visits/replicas/example01-06-1.yandex.ru
  15. columns_version: 9
  16. queue_size: 1
  17. inserts_in_queue: 0
  18. merges_in_queue: 1
  19. part_mutations_in_queue: 0
  20. queue_oldest_time: 2020-02-20 08:34:30
  21. inserts_oldest_time: 1970-01-01 00:00:00
  22. merges_oldest_time: 2020-02-20 08:34:30
  23. part_mutations_oldest_time: 1970-01-01 00:00:00
  24. oldest_part_to_get:
  25. oldest_part_to_merge_to: 20200220_20284_20840_7
  26. oldest_part_to_mutate_to:
  27. log_max_index: 596273
  28. log_pointer: 596274
  29. last_queue_update: 2020-02-20 08:34:32
  30. absolute_delay: 0
  31. total_replicas: 2
  32. active_replicas: 2

Columns:

  • database (String) - Database name
  • table (String) - Table name
  • engine (String) - Table engine name
  • is_leader (UInt8) - Whether the replica is the leader.
    Multiple replicas can be leaders at the same time. A replica can be prevented from becoming a leader using the merge_tree setting replicated_can_become_leader. The leaders are responsible for scheduling background merges.
    Note that writes can be performed to any replica that is available and has a session in ZK, regardless of whether it is a leader.
  • can_become_leader (UInt8) - Whether the replica can be a leader.
  • is_readonly (UInt8) - Whether the replica is in read-only mode.
    This mode is turned on if the config doesn’t have sections with ZooKeeper, if an unknown error occurred when reinitializing sessions in ZooKeeper, and during session reinitialization in ZooKeeper.
  • is_session_expired (UInt8) - the session with ZooKeeper has expired. Basically the same as is_readonly.
  • future_parts (UInt32) - The number of data parts that will appear as the result of INSERTs or merges that haven’t been done yet.
  • parts_to_check (UInt32) - The number of data parts in the queue for verification. A part is put in the verification queue if there is suspicion that it might be damaged.
  • zookeeper_path (String) - Path to table data in ZooKeeper.
  • replica_name (String) - Replica name in ZooKeeper. Different replicas of the same table have different names.
  • replica_path (String) - Path to replica data in ZooKeeper. The same as concatenating ‘zookeeper_path/replicas/replica_path’.
  • columns_version (Int32) - Version number of the table structure. Indicates how many times ALTER was performed. If replicas have different versions, it means some replicas haven’t made all of the ALTERs yet.
  • queue_size (UInt32) - Size of the queue for operations waiting to be performed. Operations include inserting blocks of data, merges, and certain other actions. It usually coincides with future_parts.
  • inserts_in_queue (UInt32) - Number of inserts of blocks of data that need to be made. Insertions are usually replicated fairly quickly. If this number is large, it means something is wrong.
  • merges_in_queue (UInt32) - The number of merges waiting to be made. Sometimes merges are lengthy, so this value may be greater than zero for a long time.
  • part_mutations_in_queue (UInt32) - The number of mutations waiting to be made.
  • queue_oldest_time (DateTime) - If queue_size greater than 0, shows when the oldest operation was added to the queue.
  • inserts_oldest_time (DateTime) - See queue_oldest_time
  • merges_oldest_time (DateTime) - See queue_oldest_time
  • part_mutations_oldest_time (DateTime) - See queue_oldest_time

The next 4 columns have a non-zero value only where there is an active session with ZK.

  • log_max_index (UInt64) - Maximum entry number in the log of general activity.
  • log_pointer (UInt64) - Maximum entry number in the log of general activity that the replica copied to its execution queue, plus one. If log_pointer is much smaller than log_max_index, something is wrong.
  • last_queue_update (DateTime) - When the queue was updated last time.
  • absolute_delay (UInt64) - How big lag in seconds the current replica has.
  • total_replicas (UInt8) - The total number of known replicas of this table.
  • active_replicas (UInt8) - The number of replicas of this table that have a session in ZooKeeper (i.e., the number of functioning replicas).

If you request all the columns, the table may work a bit slowly, since several reads from ZooKeeper are made for each row.
If you don’t request the last 4 columns (log_max_index, log_pointer, total_replicas, active_replicas), the table works quickly.

For example, you can check that everything is working correctly like this:

  1. SELECT
  2. database,
  3. table,
  4. is_leader,
  5. is_readonly,
  6. is_session_expired,
  7. future_parts,
  8. parts_to_check,
  9. columns_version,
  10. queue_size,
  11. inserts_in_queue,
  12. merges_in_queue,
  13. log_max_index,
  14. log_pointer,
  15. total_replicas,
  16. active_replicas
  17. FROM system.replicas
  18. WHERE
  19. is_readonly
  20. OR is_session_expired
  21. OR future_parts > 20
  22. OR parts_to_check > 10
  23. OR queue_size > 20
  24. OR inserts_in_queue > 10
  25. OR log_max_index - log_pointer > 10
  26. OR total_replicas < 2
  27. OR active_replicas < total_replicas

If this query doesn’t return anything, it means that everything is fine.

Original article