TPCH Connector

The TPCH connector provides a set of schemas to support the TPC Benchmark™ H (TPC-H). TPC-H is a database benchmark used to measure the performance of highly-complex decision support databases.

This connector can also be used to test the capabilities and query syntax of openLooKeng without configuring access to an external data source. When you query a TPCH schema, the connector generates the data on the fly using a deterministic lgorithm.

Configuration

To configure the TPCH connector, create a catalog properties file etc/catalog/tpch.properties with the following contents:

  1. connector.name=tpch

TPCH Schemas

The TPCH connector supplies several schemas:

  1. SHOW SCHEMAS FROM tpch;
  1. Schema
  2. --------------------
  3. information_schema
  4. sf1
  5. sf100
  6. sf1000
  7. sf10000
  8. sf100000
  9. sf300
  10. sf3000
  11. sf30000
  12. tiny
  13. (11 rows)

Ignore the standard schema information_schema which exists in every catalog and is not directly provided by the TPCH connector.

Every TPCH schema provides the same set of tables. Some tables are identical in all schemas. Other tables vary based on the scale factor which is determined based on the schema name. For example, the schema sf1 corresponds to scale factor 1 and the schema sf300 corresponds to scale factor 300. The TPCH connector provides an infinite number of schemas for any scale factor, not just the few common ones listed by SHOW SCHEMAS. The tiny schema is an alias for scale factor 0.01, which is a very small data set useful for testing.