11.171. Release 0.70
Warning
This release contained a packaging error that resulted in anunusable server tarball. Do not use this release.
Views
We have added support for creating views within Presto.Views are defined using Presto syntax but are stored (as blobs)by connectors. Currently, views are supported by theRaptor and Hive connectors. For the Hive connector, views arestored within the Hive metastore as Hive views, but they cannotbe queried by Hive, nor can Hive views be queried by Presto.
See CREATE VIEW and DROP VIEWfor details and examples.
DUAL Table
The synthetic DUAL
table is no longer supported. As an alternative, pleasewrite your queries without a FROM
clause or use the VALUES
syntax.
Presto Verifier
There is a new project, Presto Verifier, which can be used toverify a set of queries against two different clusters.
Connector Improvements
- Connectors can now add hidden columns to a table. Hidden columns are notdisplayed in
DESCRIBE
orinformation_schema
, and are notconsidered forSELECT *
. As an example, we have added a hiddenrow_number
column to thetpch
connector. - Presto contains an extensive test suite to verify the correctness. This testsuite has been extracted into the
presto-test
module for use duringconnector development. For an example, seeTestRaptorDistributedQueries
.
Machine Learning Functions
We have added two new machine learning functions, which can be usedby advanced users familiar with LIBSVM. The functions arelearn_libsvm_classifier
and learn_libsvm_regressor
. Both take aparameters string which has the form key=value,key=value
General Changes
- New comparison functions:
greatest()
andleast()
- New window functions:
first_value()
,last_value()
, andnth_value()
- We have added a config option to disable falling back to the interpreter whenexpressions fail to be compiled to bytecode. To set this option, add
compiler.interpreter-enabled=false
toetc/config.properties
.This will force certain queries to fail rather than running slowly. DATE
values are now implicitly coerced toTIMESTAMP
andTIMESTAMP WITH TIME ZONE
by setting the hour/minute/seconds to0
with respect to the session timezone.- Minor performance optimization when planning queries over tables with tens ofthousands of partitions or more.
- Fixed a bug when planning
ORDER BY … LIMIT
queries which could result induplicate and un-ordered results under rare conditions. - Reduce the size of stats collected from tasks, which dramatically reducesgarbage generation and improves coordinator stability.
- Fix compiler cache for expressions.
- Fix processing of empty or commented out statements in the CLI.
Hive Changes
- There are two new configuration options for the Hive connector,
hive.max-initial-split-size
, which configures the size of theinitial splits, andhive.max-initial-splits
, which configuresthe number of initial splits. This can be useful for speeding up smallqueries, which would otherwise have low parallelism. - The Hive connector will now consider all tables with a non-empty valuefor the table property
presto_offline
to be offline. The value of theproperty will be used in the error message. - We have added support for for
DROP TABLE
in the hive connector.By default, this feature is not enabled. To enable it, sethive.allow-drop-table=true
in your Hive catalog properties file. - Ignore subdirectories when generating splits(this now matches the non-recursive behavior of Hive).
- Fix handling of maps with null keys.