5.22. TPCH Connector
The TPCH connector provides a set of schemas to support the TPCBenchmark™ H (TPC-H). TPC-H is a database benchmark used to measure theperformance of highly-complex decision support databases.
This connector can also be used to test the capabilities and querysyntax of Presto without configuring access to an external datasource. When you query a TPCH schema, the connector generates thedata on the fly using a deterministic algorithm.
Configuration
To configure the TPCH connector, create a catalog properties fileetc/catalog/tpch.properties
with the following contents:
- connector.name=tpch
TPCH Schemas
The TPCH connector supplies several schemas:
- SHOW SCHEMAS FROM tpch;
- Schema
- Schema
information_schema
sf1
sf100
sf1000
sf10000
sf100000
sf300
sf3000
sf30000
tiny
(11 rows)
Ignore the standard schema information_schema
which exists in everycatalog and is not directly provided by the TPCH connector.
Every TPCH schema provides the same set of tables. Some tables areidentical in all schemas. Other tables vary based on the _scale factor_which is determined based on the schema name. For example, the schemasf1
corresponds to scale factor 1
and the schema sf300
corresponds to scale factor 300
. The TPCH connector provides aninfinite number of schemas for any scale factor, not just the few commonones listed by SHOW SCHEMAS
. The tiny
schema is an alias for scalefactor 0.01
, which is a very small data set useful for testing.