5.23. TPCH Connector
The TPCH connector provides a set of schemas to support the TPCBenchmark™ H (TPC-H). TPC-H is a database benchmark used to measure theperformance of highly-complex decision support databases.
This connector can also be used to test the capabilities and querysyntax of Presto without configuring access to an external datasource. When you query a TPCH schema, the connector generates thedata on the fly using a deterministic algorithm.
Configuration
To configure the TPCH connector, create a catalog properties fileetc/catalog/tpch.properties
with the following contents:
- connector.name=tpch
TPCH Schemas
The TPCH connector supplies several schemas:
- SHOW SCHEMAS FROM tpch;
- Schema
- Schema
information_schema sf1 sf100 sf1000 sf10000 sf100000 sf300 sf3000 sf30000 tiny(11 rows)
Ignore the standard schema information_schema
which exists in everycatalog and is not directly provided by the TPCH connector.
Every TPCH schema provides the same set of tables. Some tables areidentical in all schemas. Other tables vary based on the _scale factor_which is determined based on the schema name. For example, the schemasf1
corresponds to scale factor 1
and the schema sf300
corresponds to scale factor 300
. The TPCH connector provides aninfinite number of schemas for any scale factor, not just the few commonones listed by SHOW SCHEMAS
. The tiny
schema is an alias for scalefactor 0.01
, which is a very small data set useful for testing.