- Configuration Parameter Categories
- Append-Only Table Parameters
- Client Connection Default Parameters
- Connection and Authentication Parameters
- Database and Tablespace/Filespace Parameters
- Error Reporting and Logging Parameters
- External Table Parameters
- GPORCA Parameters
- HAWQ Array Configuration Parameters
- HAWQ Extension Framework (PXF) Parameters
- HAWQ PL/Java Extension Parameters
- HAWQ Resource Management Parameters
- Lock Management Parameters
- Past PostgreSQL Version Compatibility Parameters
- Query Tuning Parameters
- Legacy Query Optimizer Operator Control Parameters
- Legacy Query Optimizer Costing Parameters
- Database Statistics Sampling Parameters
- Sort Operator Configuration Parameters
- Aggregate Operator Configuration Parameters
- Join Operator Configuration Parameters
- Other Legacy Query Optimizer Configuration Parameters
- Ranger Configuration Parameters
- Statistics Collection Parameters
- System Resource Consumption Parameters
Configuration Parameter Categories
Configuration parameters affect categories of server behaviors, such as resource consumption, query tuning, and authentication. The following sections describe HAWQ configuration parameter categories.
Note: If you install and manage HAWQ using Ambari, be aware that property changes made using hawq config
could be overwritten by Ambari. For Ambari-managed HAWQ clusters, always use the Ambari administration interface to set or change HAWQ configuration properties.
Append-Only Table Parameters
The following parameters configure the append-only tables feature of HAWQ.
Client Connection Default Parameters
These configuration parameters set defaults that are used for client connections.
Statement Behavior Parameters
- check_function_bodies
- default_tablespace
- default_transaction_isolation
- default_transaction_read_only
- search_path
- statement_timeout
- vacuum_freeze_min_age
Locale and Formatting Parameters
- client_encoding
- DateStyle
- extra_float_digits
- IntervalStyle
- lc_collate
- lc_ctype
- lc_messages
- lc_monetary
- lc_numeric
- lc_time
- TimeZone
Other Client Default Parameters
Connection and Authentication Parameters
These parameters control how clients connect and authenticate to HAWQ.
Connection Parameters
- listen_addresses
- max_connections
- seg_max_connections
- superuser_reserved_connections
- tcp_keepalives_count
- tcp_keepalives_idle
- tcp_keepalives_interval
- unix_socket_directory
- unix_socket_group
- unix_socket_permissions
Security and Authentication Parameters
- authentication_timeout
- db_user_namespace
- enable_secure_filesystem
- krb_caseins_users
- krb_server_keyfile
- krb_srvname
- password_encryption
- password_hash_algorithm
- ssl
- ssl_ciphers
Database and Tablespace/Filespace Parameters
The following parameters configure the maximum number of databases, tablespaces, and filespaces allowed in a system.
Error Reporting and Logging Parameters
These configuration parameters control HAWQ logging.
Log Rotation
When to Log
- client_min_messages
- log_error_verbosity
- log_min_duration_statement
- log_min_error_statement
- log_min_messages
- optimizer_minidump
What to Log
- debug_pretty_print
- debug_print_parse
- debug_print_parse
- debug_print_plan
- debug_print_prelim_plan
- debug_print_rewritten
- debug_print_slice_table
- log_autostats
- log_connections
- log_disconnections
- log_dispatch_stats
- log_duration
- log_executor_stats
- log_hostname
- log_parser_stats
- log_planner_stats
- log_statement
- log_statement_stats
- log_timezone
- gp_debug_linger
- gp_log_format
- gp_max_csv_line_length
- gp_reraise_signal
External Table Parameters
The following parameters configure the external tables feature of HAWQ.
GPORCA Parameters
These parameters control the usage of GPORCA by HAWQ. For information about GPORCA, see About GPORCA.
- optimizer
- optimizer_analyze_root_partition
- optimizer_minidump
- optimizer_parts_to_force_sort_on_insert
- optimizer_prefer_scalar_dqa_multistage_agg
HAWQ Array Configuration Parameters
The parameters in this topic control the configuration of the HAWQ array and its components: segments, master, distributed transaction manager, master mirror, and interconnect.
Interconnect Configuration Parameters
- gp_interconnect_cache_future_packets
- gp_interconnect_default_rtt
- gp_interconnect_fc_method
- gp_interconnect_hash_multiplier
- gp_interconnect_min_retries_before_timeout
- gp_interconnect_min_rto
- gp_interconnect_queue_depth
- gp_interconnect_setup_timeout
- gp_interconnect_snd_queue_depth
- gp_interconnect_timer_checking_period
- gp_interconnect_timer_period
- gp_interconnect_type
- gp_max_packet_size
Dispatch Configuration Parameters
- gp_cached_segworkers_threshold
- gp_connections_per_thread
- gp_enable_direct_dispatch
- gp_segment_connect_timeout
- gp_set_proc_affinity
- gp_vmem_idle_resource_timeout
Fault Operation Parameters
Filepace Parameters
Master Configuration Parameters
Read-Only Parameters
Segment Configuration Parameters
HAWQ Extension Framework (PXF) Parameters
The parameters in this topic control configuration, query analysis, and statistics collection in the HAWQ Extension Framework (PXF).
- pxf_enable_filter_pushdown
- pxf_enable_stat_collection
- pxf_remote_service_login
- pxf_remote_service_secret
- pxf_service_address
- pxf_service_port
- pxf_stat_max_fragments
HAWQ PL/Java Extension Parameters
The parameters in this topic control the configuration of HAWQ PL/Java extensions.
HAWQ Resource Management Parameters
The following configuration parameters configure the HAWQ resource management feature.
- hawq_global_rm_type
- hawq_re_memory_overcommit_max
- hawq_rm_cluster_report_period
- hawq_rm_force_alterqueue_cancel_queued_request
- hawq_rm_master_port
- hawq_rm_memory_limit_perseg
- hawq_rm_min_resource_perseg
- hawq_rm_nresqueue_limit
- hawq_rm_nslice_perseg_limit
- hawq_rm_nvcore_limit_perseg
- hawq_rm_nvseg_perquery_limit
- hawq_rm_nvseg_perquery_perseg_limit
- hawq_rm_nvseg_variance_amon_seg_limit
- hawq_rm_rejectrequest_nseg_limit
- hawq_rm_resource_idle_timeout
- hawq_rm_return_percent_on_overcommit
- hawq_rm_segment_heartbeat_interval
- hawq_rm_segment_port
- hawq_rm_stmt_nvseg
- hawq_rm_stmt_vseg_memory
- hawq_rm_tolerate_nseg_limit
- hawq_rm_yarn_address
- hawq_rm_yarn_app_name
- hawq_rm_yarn_queue_name
- hawq_rm_yarn_scheduler_address
Lock Management Parameters
These configuration parameters set limits for locks and deadlocks.
Past PostgreSQL Version Compatibility Parameters
The following parameters provide compatibility with older PostgreSQL versions. You do not need to change these parameters in HAWQ.
- add_missing_from
- array_nulls
- backslash_quote
- escape_string_warning
- regex_flavor
- standard_conforming_strings
- transform_null_equals
Query Tuning Parameters
These parameters control aspects of SQL query processing such as query operators and operator settings and statistics sampling.
Legacy Query Optimizer Operator Control Parameters
The following parameters control the types of plan operations the legacy query optimizer can use. Enable or disable plan operations to force the legacy optimizer to choose a different plan. This is useful for testing and comparing query performance using different plan types.
- enable_bitmapscan
- enable_groupagg
- enable_hashagg
- enable_hashjoin
- enable_indexscan
- enable_mergejoin
- enable_nestloop
- enable_seqscan
- enable_sort
- enable_tidscan
- gp_enable_agg_distinct
- gp_enable_agg_distinct_pruning
- gp_enable_direct_dispatch
- gp_enable_fallback_plan
- gp_enable_fast_sri
- gp_enable_groupext_distinct_gather
- gp_enable_groupext_distinct_pruning
- gp_enable_multiphase_agg
- gp_enable_predicate_propagation
- gp_enable_preunique
- gp_enable_sequential_window_plans
- gp_enable_sort_distinct
- gp_enable_sort_limit
Legacy Query Optimizer Costing Parameters
Warning: Do not adjust these query costing parameters. They are tuned to reflect HAWQ hardware configurations and typical workloads. All of these parameters are related. Changing one without changing the others can have adverse affects on performance.
- cpu_index_tuple_cost
- cpu_operator_cost
- cpu_tuple_cost
- cursor_tuple_fraction
- effective_cache_size
- gp_motion_cost_per_row
- gp_segments_for_planner
- random_page_cost
- seq_page_cost
Database Statistics Sampling Parameters
These parameters adjust the amount of data sampled by an ANALYZE
operation. Adjusting these parameters affects statistics collection system-wide. You can configure statistics collection on particular tables and columns by using the ALTER TABLE
SET STATISTICS
clause. See About Database Statistics.
Sort Operator Configuration Parameters
Aggregate Operator Configuration Parameters
- gp_enable_agg_distinct
- gp_enable_agg_distinct_pruning
- gp_enable_multiphase_agg
- gp_enable_preunique
- gp_enable_groupext_distinct_gather
- gp_enable_groupext_distinct_pruning
- gp_workfile_compress_algorithm
Join Operator Configuration Parameters
- join_collapse_limit
- gp_adjust_selectivity_for_outerjoins
- gp_hashjoin_tuples_per_bucket
- gp_statistics_use_fkeys
- gp_workfile_compress_algorithm
Other Legacy Query Optimizer Configuration Parameters
- from_collapse_limit
- gp_enable_predicate_propagation
- gp_max_plan_size
- gp_statistics_pullup_from_child_partition
Ranger Configuration Parameters
These parameters control certain aspects of Ranger configuration, including enabling Ranger authorization for HAWQ and the HAWQ Ranger Plug-in Service port setting.
Statistics Collection Parameters
Automatic Statistics Collection
When automatic statistics collection is enabled, you can run ANALYZE
automatically in the same transaction as an INSERT
, COPY
or CREATE TABLE...AS SELECT
statement when a certain threshold of rows is affected (on_change
), or when a newly generated table has no statistics (on_no_stats
). To enable this feature, set the following server configuration parameters in your HAWQ hawq-site.xml
file by using the hawq config
utility and restart HAWQ:
Runtime Statistics Collection Parameters
These parameters control the server statistics collection feature. When statistics collection is enabled, you can access the statistics data using the pg_stat and pg_statio family of system catalog views.
System Resource Consumption Parameters
These parameters set the limits for system resources consumed by HAWQ.
Memory Consumption Parameters
These parameters control system memory usage. You can adjust hawq_re_memory_overcommit_max
to avoid running out of memory at the segment hosts during query processing. See also HAWQ Resource Management.
- hawq_re_memory_overcommit_max
- gp_vmem_protect_segworker_cache_limit
- gp_workfile_limit_files_per_query
- gp_workfile_limit_per_query
- gp_workfile_limit_per_segment
- maintenance_work_mem
- max_stack_depth
- shared_buffers
- temp_buffers
Free Space Map Parameters
These parameters control the sizing of the free space map, which contains expired rows. Use VACUUM
to reclaim the free space map disk space.
OS Resource Parameters
Cost-Based Vacuum Delay Parameters
Warning: Avoid using cost-based vacuum delay because it runs asynchronously among the segment instances. The vacuum cost limit and delay is invoked at the segment level without taking into account the state of the entire HAWQ array
You can configure the execution cost of VACUUM
and ANALYZE
commands to reduce the I/O impact on concurrent database activity. When the accumulated cost of I/O operations reaches the limit, the process performing the operation sleeps for a while, Then resets the counter and continues execution