Iceberg Connector
Overview
The Iceberg connector allows querying data stored in Iceberg tables.
Note
It is recommended to use Iceberg 0.9.0 or later.
Configuration
To configure the Iceberg connector, create a catalog properties file etc/catalog/iceberg.properties
with the following contents, replacing the properties as appropriate:
connector.name=iceberg
hive.metastore.uri=hostname:port
iceberg.catalog.type=hive
Configuration Properties
The following configuration properties are available:
Property Name | Description |
---|---|
| The URI(s) of the Hive metastore. |
| The storage file format for Iceberg tables. |
| The compression codec to use when writing files. |
| The catalog type for Iceberg tables. |
| The catalog warehouse root path for Iceberg tables. |
| The number of Iceberg catalogs to cache. |
| The path(s) for Hadoop configuration resources. |
hive.metastore.uri
The URI(s) of the Hive metastore to connect to using the Thrift protocol. If multiple URIs are provided, the first URI is used by default and the rest of the URIs are fallback metastores. This property is required. Example: thrift://192.0.2.3:9083
or thrift://192.0.2.3:9083,thrift://192.0.2.4:9083
iceberg.file-format
The storage file format for Iceberg tables. The available values are PARQUET
and ORC
.
The default is PARQUET
.
iceberg.compression-codec
The compression codec to use when writing files. The available values are NONE
, SNAPPY
, GZIP
, LZ4
, and ZSTD
.
The default is GZIP
.
iceberg.catalog.type
The catalog type for Iceberg tables. The available values are hive
and hadoop
, corresponding to the catalogs in the Iceberg.
The default is hive
.
iceberg.catalog.warehouse
The catalog warehouse root path for Iceberg tables. Example: hdfs://nn:8020/warehouse/path
.
This property is required if the iceberg.catalog.type
is hadoop
. Otherwise, it will be ignored.
iceberg.catalog.cached-catalog-num
The number of Iceberg catalogs to cache.
The default is 10
. This property is required if the iceberg.catalog.type
is hadoop
. Otherwise, it will be ignored.
iceberg.hadoop.config.resources
The path(s) for Hadoop configuration resources. Example: /etc/hadoop/conf/core-site.xml
.
This property is required if the iceberg.catalog.type
is hadoop
. Otherwise, it will be ignored.