Kylin 配置
Kylin 会自动从环境中检测 Hadoop/Hive/HBase 配置,如 “core-site.xml”, “hbase-site.xml” 和其他。除此之外,Kylin 有自己的配置,在 “conf” 文件夹下。
-bash-4.1# ls -l $KYLIN_HOME/conf
kylin_hive_conf.xml
kylin_job_conf_inmem.xml
kylin_job_conf.xml
kylin-kafka-consumer.xml
kylin.properties
kylin-server-log4j.properties
kylin-tools-log4j.properties
setenv.sh
kylin_hive_conf.xml
Kylin 从 Hive 中取数据时应用的 Hive 配置。
kylin_job_conf.xml and kylin_job_conf_inmem.xml
Kylin 运行 MapReduce jobs 时的 Hadoop MR 配置。在 Kylin 的 “In-mem cubing” job 的时候,”kylin_job_conf_inmem.xml” 需要更多的 memory 给 mapper。
kylin-kafka-consumer.xml
Kylin 从 Kafka brokers 中取数据时应用的 Kafka 配置。
kylin-server-log4j.properties
Kylin 服务器的日志配置。
kylin-tools-log4j.properties
Kylin 命令行的日志配置。
setenv.sh
设置环境变量的 shell 脚本。它将在 “kylin.sh” 和 “bin” 文件夹中的其它脚本中被调用。通常,您可以在这里调整 Kylin JVM 栈的大小,且可以设置 “KAFKA_HOME” 和其他环境变量。
kylin.properties
Kylin 的主要配置文件。
Key | Default value | Description | Overwritten at Cube |
---|---|---|---|
kylin.env | Dev | Whether this env is a Dev, QA, or Prod environment | No |
kylin.env.hdfs-working-dir | /kylin | Working directory on HDFS | No |
kylin.env.zookeeper-base-path | /kylin | Path on ZK | No |
kylin.env.zookeeper-connect-string | ZK connection string; If blank, use HBase’s ZK | No | |
kylin.env.zookeeper-acl-enabled | false | No | |
kylin.env.zookeeper.zk-auth | digest:ADMIN:KYLIN | No | |
kylin.env.zookeeper.zk-acl | world:anyone:rwcda | No | |
kylin.metadata.url | kylinmetadata@hbase | Kylin metadata storage | No |
kylin.metadata.sync-retries | 3 | No | |
kylin.metadata.sync-error-handler | No | ||
kylin.metadata.check-copy-on-write | false | No | |
kylin.metadata.hbase-client-scanner-timeout-period | 10000 | No | |
kylin.metadata.hbase-rpc-timeout | 5000 | No | |
kylin.metadata.hbase-client-retries-number | 1 | No | |
kylin.dictionary.use-forest-trie | true | No | |
kylin.dictionary.forest-trie-max-mb | 500 | No | |
kylin.dictionary.max-cache-entry | 3000 | No | |
kylin.dictionary.growing-enabled | false | No | |
kylin.dictionary.append-entry-size | 10000000 | No | |
kylin.dictionary.append-max-versions | 3 | No | |
kylin.dictionary.append-version-ttl | 259200000 | No | |
kylin.snapshot.max-cache-entry | 500 | No | |
kylin.snapshot.max-mb | 300 | No | |
kylin.snapshot.ext.shard-mb | 500 | No | |
kylin.snapshot.ext.local.cache.path | lookup_cache | No | |
kylin.snapshot.ext.local.cache.max-size-gb | 200 | No | |
kylin.cube.size-estimate-ratio | 0.25 | Yes | |
kylin.cube.size-estimate-memhungry-ratio | 0.05 | Deprecated | Yes |
kylin.cube.size-estimate-countdistinct-ratio | 0.05 | Yes | |
kylin.cube.algorithm | auto | Cubing algorithm for MR engine, other options: layer, inmem | Yes |
kylin.cube.algorithm.layer-or-inmem-threshold | 7 | Yes | |
kylin.cube.algorithm.inmem-split-limit | 500 | Yes | |
kylin.cube.algorithm.inmem-concurrent-threads | 1 | Yes | |
kylin.cube.ignore-signature-inconsistency | false | ||
kylin.cube.aggrgroup.max-combination | 4096 | Max cuboid numbers in a Cube | Yes |
kylin.cube.aggrgroup.is-mandatory-only-valid | false | Whether allow a Cube only has the base cuboid. | Yes |
kylin.cube.rowkey.max-size | 63 | Max columns in Rowkey, and it can not be more than 63 | No |
kylin.metadata.dimension-encoding-max-length | 256 | Max length for one dimension’s encoding | Yes |
kylin.cube.max-building-segments | 10 | Max building segments in one Cube | Yes |
kylin.cube.allow-appear-in-multiple-projects | false | Whether allow a Cueb appeared in multiple projects | No |
kylin.cube.gtscanrequest-serialization-level | 1 | ||
kylin.cube.is-automerge-enabled | true | Whether enable auto merge. | Yes |
kylin.job.log-dir | /tmp/kylin/logs | ||
kylin.job.allow-empty-segment | true | Whether tolerant data source is emtpy. | Yes |
kylin.job.max-concurrent-jobs | 10 | Max concurrent running jobs | No |
kylin.job.sampling-percentage | 100 | Data sampling percentage, to calculate Cube statistics; Default be all. | Yes |
kylin.job.notification-enabled | false | Whether send email notification on job error/succeed. | No |
kylin.job.notification-mail-enable-starttls | false | No | |
kylin.job.notification-mail-port | 25 | No | |
kylin.job.notification-mail-host | No | ||
kylin.job.notification-mail-username | No | ||
kylin.job.notification-mail-password | No | ||
kylin.job.notification-mail-sender | No | ||
kylin.job.notification-admin-emails | No | ||
kylin.job.retry | 0 | No | |
kylin.job.scheduler.priority-considered | false | No | |
kylin.job.scheduler.priority-bar-fetch-from-queue | 20 | No | |
kylin.job.scheduler.poll-interval-second | 30 | No | |
kylin.job.error-record-threshold | 0 | No | |
kylin.source.hive.keep-flat-table | false | Whether keep the intermediate Hive table after job finished. | No |
kylin.source.hive.database-for-flat-table | default | Hive database to create the intermediate table. | No |
kylin.source.hive.flat-table-storage-format | SEQUENCEFILE | No | |
kylin.source.hive.flat-table-field-delimiter | \u001F | No | |
kylin.source.hive.redistribute-flat-table | true | Whether or not to redistribute the flat table. | Yes |
kylin.source.hive.client | cli | No | |
kylin.source.hive.beeline-shell | beeline | No | |
kylin.source.hive.beeline-params | No | ||
kylin.source.hive.enable-sparksql-for-table-ops | false | No | |
kylin.source.hive.sparksql-beeline-shell | No | ||
kylin.source.hive.sparksql-beeline-params | No | ||
kylin.source.hive.table-dir-create-first | false | No | |
kylin.source.hive.flat-table-cluster-by-dict-column | |||
kylin.source.hive.default-varchar-precision | 256 | No | |
kylin.source.hive.default-char-precision | 255 | No | |
kylin.source.hive.default-decimal-precision | 19 | No | |
kylin.source.hive.default-decimal-scale | 4 | No | |
kylin.source.jdbc.connection-url | |||
kylin.source.jdbc.driver | |||
kylin.source.jdbc.dialect | default | ||
kylin.source.jdbc.user | |||
kylin.source.jdbc.pass | |||
kylin.source.jdbc.sqoop-home | |||
kylin.source.jdbc.sqoop-mapper-num | 4 | ||
kylin.source.jdbc.field-delimiter | | | ||
kylin.storage.default | 2 | No | |
kylin.storage.hbase.table-name-prefix | KYLIN | No | |
kylin.storage.hbase.namespace | default | No | |
kylin.storage.hbase.cluster-fs | |||
kylin.storage.hbase.cluster-hdfs-config-file | |||
kylin.storage.hbase.coprocessor-local-jar | |||
kylin.storage.hbase.min-region-count | 1 | ||
kylin.storage.hbase.max-region-count | 500 | ||
kylin.storage.hbase.hfile-size-gb | 2.0 | ||
kylin.storage.hbase.run-local-coprocessor | false | ||
kylin.storage.hbase.coprocessor-mem-gb | 3.0 | ||
kylin.storage.partition.aggr-spill-enabled | true | ||
kylin.storage.partition.max-scan-bytes | 3221225472 | ||
kylin.storage.hbase.coprocessor-timeout-seconds | 0 | ||
kylin.storage.hbase.max-fuzzykey-scan | 200 | ||
kylin.storage.hbase.max-fuzzykey-scan-split | 1 | ||
kylin.storage.hbase.max-visit-scanrange | 1000000 | ||
kylin.storage.hbase.scan-cache-rows | 1024 | ||
kylin.storage.hbase.region-cut-gb | 5.0 | ||
kylin.storage.hbase.max-scan-result-bytes | 5242880 | ||
kylin.storage.hbase.compression-codec | none | ||
kylin.storage.hbase.rowkey-encoding | FAST_DIFF | ||
kylin.storage.hbase.block-size-bytes | 1048576 | ||
kylin.storage.hbase.small-family-block-size-bytes | 65536 | ||
kylin.storage.hbase.owner-tag | |||
kylin.storage.hbase.endpoint-compress-result | true | ||
kylin.storage.hbase.max-hconnection-threads | 2048 | ||
kylin.storage.hbase.core-hconnection-threads | 2048 | ||
kylin.storage.hbase.hconnection-threads-alive-seconds | 60 | ||
kylin.engine.mr.lib-dir | |||
kylin.engine.mr.reduce-input-mb | 500 | ||
kylin.engine.mr.reduce-count-ratio | 1.0 | ||
kylin.engine.mr.min-reducer-number | 1 | ||
kylin.engine.mr.max-reducer-number | 500 | ||
kylin.engine.mr.mapper-input-rows | 1000000 | ||
kylin.engine.mr.max-cuboid-stats-calculator-number | 1 | ||
kylin.engine.mr.uhc-reducer-count | 1 | ||
kylin.engine.mr.build-uhc-dict-in-additional-step | false | ||
kylin.engine.mr.build-dict-in-reducer | true | ||
kylin.engine.mr.yarn-check-interval-seconds | 10 | ||
kylin.env.hadoop-conf-dir | Hadoop conf directory; If not specified, parse from environment. | No | |
kylin.engine.spark.rdd-partition-cut-mb | 10.0 | Spark Cubing RDD partition split size. | Yes |
kylin.engine.spark.min-partition | 1 | Spark Cubing RDD min partition number | Yes |
kylin.engine.spark.max-partition | 5000 | RDD max partition number | Yes |
kylin.engine.spark.storage-level | MEMORY_AND_DISK_SER | RDD persistent level. | Yes |
kylin.query.skip-empty-segments | true | Whether directly skip empty segment (metadata shows size be 0) when run SQL query. | Yes |
kylin.query.force-limit | -1 | ||
kylin.query.max-scan-bytes | 0 | ||
kylin.query.max-return-rows | 5000000 | ||
kylin.query.large-query-threshold | 1000000 | ||
kylin.query.cache-threshold-duration | 2000 | ||
kylin.query.cache-threshold-scan-count | 10240 | ||
kylin.query.cache-threshold-scan-bytes | 1048576 | ||
kylin.query.security-enabled | true | ||
kylin.query.cache-enabled | true | ||
kylin.query.timeout-seconds | 0 | ||
kylin.query.pushdown.runner-class-name | |||
kylin.query.pushdown.update-enabled | false | ||
kylin.query.pushdown.cache-enabled | false | ||
kylin.query.pushdown.jdbc.url | |||
kylin.query.pushdown.jdbc.driver | |||
kylin.query.pushdown.jdbc.username | |||
kylin.query.pushdown.jdbc.password | |||
kylin.query.pushdown.jdbc.pool-max-total | 8 | ||
kylin.query.pushdown.jdbc.pool-max-idle | 8 | ||
kylin.query.pushdown.jdbc.pool-min-idle | 0 | ||
kylin.query.security.table-acl-enabled | true | No | |
kylin.server.mode | all | Kylin node mode: all|job|query. | No |
kylin.server.cluster-servers | localhost:7070 | No | |
kylin.server.cluster-name | No | ||
kylin.server.query-metrics-enabled | false | No | |
kylin.server.query-metrics2-enabled | false | No | |
kylin.server.auth-user-cache.expire-seconds | 300 | No | |
kylin.server.auth-user-cache.max-entries | 100 | No | |
kylin.server.external-acl-provider | No | ||
kylin.security.ldap.user-search-base | No | ||
kylin.security.ldap.user-group-search-base | No | ||
kylin.security.acl.admin-role | No | ||
kylin.web.timezone | PST | No | |
kylin.web.cross-domain-enabled | true | No | |
kylin.web.export-allow-admin | true | No | |
kylin.web.export-allow-other | true | No | |
kylin.web.dashboard-enabled | false | No |