Apache Parquet Extension

Download 来源:druid 浏览 166 扫码分享 2023-05-28 21:01:57

Apache Parquet Extension

Apache Parquet Extension

This Apache Druid module extends Druid Hadoop based indexing to ingest data directly from offline Apache Parquet files.

Note: If using the parquet-avro parser for Apache Hadoop based indexing, druid-parquet-extensions depends on the druid-avro-extensions module, so be sure to include both.

The druid-parquet-extensions provides the Parquet input format, the Parquet Hadoop parser, and the Parquet Avro Hadoop Parser with druid-avro-extensions. The Parquet input format is available for native batch ingestion and the other 2 parsers are for Hadoop batch ingestion. Please see corresponding docs for details.

当前内容版权归 druid 或其关联方所有，如需对内容或内容相关联开源项目进行关注与资助，请访问 druid .

本文档使用 BookStack 构建