Release 0.183
General Changes
Fix planning failure for queries that use
GROUPING
and contain aggregation expressions that require implicit coercions.Fix planning failure for queries that contains a non-equi left join that is semantically equivalent to an inner join.
Fix issue where a query may have a reported memory that is higher than actual usage when an aggregation is followed by other non-trivial work in the same stage. This can lead to failures due to query memory limit, or lower cluster throughput due to perceived insufficient memory.
Fix query failure for
CHAR
functionstrim()
,rtrim()
, andsubstr()
when the return value would have trailing spaces underVARCHAR
semantics.Fix formatting in
EXPLAIN ANALYZE
output.Improve error message when a query contains an unsupported form of correlated subquery.
Improve performance of
CAST(json_parse(...) AS ...)
.Add
map_from_entries()
andmap_entries()
functions.Change spilling for aggregations to only occur when the cluster runs out of memory.
Remove the
experimental.operator-memory-limit-before-spill
config property and theoperator_memory_limit_before_spill
session property.Allow configuring the amount of memory that can be used for merging spilled aggregation data from disk using the
experimental.aggregation-operator-unspill-memory-limit
config property or theaggregation_operator_unspill_memory_limit
session property.
Web UI Changes
- Add output rows, output size, written rows and written size to query detail page.
Hive Changes
Work around ORC-222 which results in invalid summary statistics in ORC or DWRF files when the input data contains invalid string data. Previously, this would usually cause the query to fail, but in rare cases it could cause wrong results by incorrectly skipping data based on the invalid statistics.
Fix issue where reported memory is lower than actual usage for table columns containing string values read from ORC or DWRF files. This can lead to high GC overhead or out-of-memory crash.
Improve error message for small ORC files that are completely corrupt or not actually ORC.
Add predicate pushdown for the hidden column
"$path"
.
TPCH Changes
- Add column statistics for schemas
tiny
andsf1
.
TPCDS Changes
- Add column statistics for schemas
tiny
andsf1
.
SPI Changes
Map columns or values represented with
ArrayBlock
andInterleavedBlock
are no longer supported. They must be represented asMapBlock
orSingleMapBlock
.Extend column statistics with minimal and maximal value.
Replace
nullsCount
withnullsFraction
in column statistics.