LIVE_SUMMARY Query Option (Impala 2.3 or higher only)
When the LIVE_SUMMARY
query option is set to TRUE
, Impala displays the same output as the SUMMARY
command for queries submitted through the impala-shell command, with the measurements updated in real time as the query progresses. When the query finishes, the final SUMMARY
output remains visible in the impala-shell console output.
Starting in Impala 3.1, the summary output also includes the queuing status consisting of whether the query was queued and what was the latest queuing reason.
Type: Boolean
Default: FALSE (0)
Command-line equivalent:
You can enable this query option within impala-shell by starting the shell with the --live_summary
command-line option. You can still turn this setting off and on again within the shell through the SET
command.
Usage notes:
The live summary output can be useful for evaluating long-running queries, to evaluate which phase of execution takes up the most time, or if some hosts take much longer than others for certain operations, dragging overall performance down. By making the information available in real time, this feature lets you decide what action to take even before you cancel a query that is taking much longer than normal.
For example, you might see the HDFS scan phase taking a long time, and therefore revisit performance-related aspects of your schema design such as constructing a partitioned table, switching to the Parquet file format, running the COMPUTE STATS
statement for the table, and so on. Or you might see a wide variation between the average and maximum times for all hosts to perform some phase of the query, and therefore investigate if one particular host needed more memory or was experiencing a network problem.
The output from this query option is printed to standard error. The output is displayed only in interactive mode, not when the -q
or -f
options are used.
For a simple and concise way of tracking the progress of an interactive query, see LIVE_PROGRESS Query Option (Impala 2.3 or higher only).
Restrictions:
The LIVE_PROGRESS
and LIVE_SUMMARY
query options currently do not produce any output during COMPUTE STATS
operations.
Because the LIVE_PROGRESS
and LIVE_SUMMARY
query options are available only within the impala-shell interpreter:
You cannot change these query options through the SQL
SET
statement using the JDBC or ODBC interfaces. TheSET
command in impala-shell recognizes these names as shell-only options.Be careful when using impala-shell on a pre-Impala 2.3 system to connect to a system running Impala 2.3 or higher. The older impala-shell does not recognize these query option names. Upgrade impala-shell on the systems where you intend to use these query options.
Likewise, the impala-shell command relies on some information only available in Impala 2.3 and higher to prepare live progress reports and query summaries. The
LIVE_PROGRESS
andLIVE_SUMMARY
query options have no effect when impala-shell connects to a cluster running an older version of Impala.
Added in: Impala 2.3.0
To see how the LIVE_PROGRESS
and LIVE_SUMMARY
query options work in real time, see this animated demo.
Parent topic: Query Options for the SET Statement