Kylin 配置

Kylin 会自动从环境中检测 Hadoop/Hive/HBase 配置,如 “core-site.xml”, “hbase-site.xml” 和其他。除此之外,Kylin 有自己的配置,在 “conf” 文件夹下。

  1. -bash-4.1# ls -l $KYLIN_HOME/conf
  2. kylin_hive_conf.xml
  3. kylin_job_conf_inmem.xml
  4. kylin_job_conf.xml
  5. kylin-kafka-consumer.xml
  6. kylin.properties
  7. kylin-server-log4j.properties
  8. kylin-tools-log4j.properties
  9. setenv.sh

kylin_hive_conf.xml

Kylin 从 Hive 中取数据时应用的 Hive 配置。

kylin_job_conf.xml and kylin_job_conf_inmem.xml

Kylin 运行 MapReduce jobs 时的 Hadoop MR 配置。在 Kylin 的 “In-mem cubing” job 的时候,”kylin_job_conf_inmem.xml” 需要更多的 memory 给 mapper。

kylin-kafka-consumer.xml

Kylin 从 Kafka brokers 中取数据时应用的 Kafka 配置。

kylin-server-log4j.properties

Kylin 服务器的日志配置。

kylin-tools-log4j.properties

Kylin 命令行的日志配置。

setenv.sh

设置环境变量的 shell 脚本。它将在 “kylin.sh” 和 “bin” 文件夹中的其它脚本中被调用。通常,您可以在这里调整 Kylin JVM 栈的大小,且可以设置 “KAFKA_HOME” 和其他环境变量。

kylin.properties

Kylin 的主要配置文件。

KeyDefault valueDescriptionOverwritten at Cube
kylin.envDevWhether this env is a Dev, QA, or Prod environmentNo
kylin.env.hdfs-working-dir/kylinWorking directory on HDFSNo
kylin.env.zookeeper-base-path/kylinPath on ZKNo
kylin.env.zookeeper-connect-string ZK connection string; If blank, use HBase’s ZKNo
kylin.env.zookeeper-acl-enabledfalse No
kylin.env.zookeeper.zk-authdigest:ADMIN:KYLIN No
kylin.env.zookeeper.zk-aclworld:anyone:rwcda No
kylin.metadata.urlkylinmetadata@hbaseKylin metadata storageNo
kylin.metadata.sync-retries3 No
kylin.metadata.sync-error-handler  No
kylin.metadata.check-copy-on-writefalse No
kylin.metadata.hbase-client-scanner-timeout-period10000 No
kylin.metadata.hbase-rpc-timeout5000 No
kylin.metadata.hbase-client-retries-number1 No
kylin.dictionary.use-forest-trietrue No
kylin.dictionary.forest-trie-max-mb500 No
kylin.dictionary.max-cache-entry3000 No
kylin.dictionary.growing-enabledfalse No
kylin.dictionary.append-entry-size10000000 No
kylin.dictionary.append-max-versions3 No
kylin.dictionary.append-version-ttl259200000 No
kylin.snapshot.max-cache-entry500 No
kylin.snapshot.max-mb300 No
kylin.snapshot.ext.shard-mb500 No
kylin.snapshot.ext.local.cache.pathlookup_cache No
kylin.snapshot.ext.local.cache.max-size-gb200 No
kylin.cube.size-estimate-ratio0.25 Yes
kylin.cube.size-estimate-memhungry-ratio0.05DeprecatedYes
kylin.cube.size-estimate-countdistinct-ratio0.05 Yes
kylin.cube.algorithmautoCubing algorithm for MR engine, other options: layer, inmemYes
kylin.cube.algorithm.layer-or-inmem-threshold7 Yes
kylin.cube.algorithm.inmem-split-limit500 Yes
kylin.cube.algorithm.inmem-concurrent-threads1 Yes
kylin.cube.ignore-signature-inconsistencyfalse  
kylin.cube.aggrgroup.max-combination4096Max cuboid numbers in a CubeYes
kylin.cube.aggrgroup.is-mandatory-only-validfalseWhether allow a Cube only has the base cuboid.Yes
kylin.cube.rowkey.max-size63Max columns in Rowkey, and it can not be more than 63No
kylin.metadata.dimension-encoding-max-length256Max length for one dimension’s encodingYes
kylin.cube.max-building-segments10Max building segments in one CubeYes
kylin.cube.allow-appear-in-multiple-projectsfalseWhether allow a Cueb appeared in multiple projectsNo
kylin.cube.gtscanrequest-serialization-level1  
kylin.cube.is-automerge-enabledtrueWhether enable auto merge.Yes
kylin.job.log-dir/tmp/kylin/logs  
kylin.job.allow-empty-segmenttrueWhether tolerant data source is emtpy.Yes
kylin.job.max-concurrent-jobs10Max concurrent running jobsNo
kylin.job.sampling-percentage100Data sampling percentage, to calculate Cube statistics; Default be all.Yes
kylin.job.notification-enabledfalseWhether send email notification on job error/succeed.No
kylin.job.notification-mail-enable-starttlsfalse No
kylin.job.notification-mail-port25 No
kylin.job.notification-mail-host  No
kylin.job.notification-mail-username  No
kylin.job.notification-mail-password  No
kylin.job.notification-mail-sender  No
kylin.job.notification-admin-emails  No
kylin.job.retry0 No
    
kylin.job.scheduler.priority-consideredfalse No
kylin.job.scheduler.priority-bar-fetch-from-queue20 No
kylin.job.scheduler.poll-interval-second30 No
kylin.job.error-record-threshold0 No
kylin.source.hive.keep-flat-tablefalseWhether keep the intermediate Hive table after job finished.No
kylin.source.hive.database-for-flat-tabledefaultHive database to create the intermediate table.No
kylin.source.hive.flat-table-storage-formatSEQUENCEFILE No
kylin.source.hive.flat-table-field-delimiter\u001F No
kylin.source.hive.redistribute-flat-tabletrueWhether or not to redistribute the flat table.Yes
kylin.source.hive.clientcli No
kylin.source.hive.beeline-shellbeeline No
kylin.source.hive.beeline-params  No
kylin.source.hive.enable-sparksql-for-table-opsfalse No
kylin.source.hive.sparksql-beeline-shell  No
kylin.source.hive.sparksql-beeline-params  No
kylin.source.hive.table-dir-create-firstfalse No
kylin.source.hive.flat-table-cluster-by-dict-column   
kylin.source.hive.default-varchar-precision256 No
kylin.source.hive.default-char-precision255 No
kylin.source.hive.default-decimal-precision19 No
kylin.source.hive.default-decimal-scale4 No
kylin.source.jdbc.connection-url   
kylin.source.jdbc.driver   
kylin.source.jdbc.dialectdefault  
kylin.source.jdbc.user   
kylin.source.jdbc.pass   
kylin.source.jdbc.sqoop-home   
kylin.source.jdbc.sqoop-mapper-num4  
kylin.source.jdbc.field-delimiter|  
kylin.storage.default2 No
kylin.storage.hbase.table-name-prefixKYLIN No
kylin.storage.hbase.namespacedefault No
kylin.storage.hbase.cluster-fs   
kylin.storage.hbase.cluster-hdfs-config-file   
kylin.storage.hbase.coprocessor-local-jar   
kylin.storage.hbase.min-region-count1  
kylin.storage.hbase.max-region-count500  
kylin.storage.hbase.hfile-size-gb2.0  
kylin.storage.hbase.run-local-coprocessorfalse  
kylin.storage.hbase.coprocessor-mem-gb3.0  
kylin.storage.partition.aggr-spill-enabledtrue  
kylin.storage.partition.max-scan-bytes3221225472  
kylin.storage.hbase.coprocessor-timeout-seconds0  
kylin.storage.hbase.max-fuzzykey-scan200  
kylin.storage.hbase.max-fuzzykey-scan-split1  
kylin.storage.hbase.max-visit-scanrange1000000  
kylin.storage.hbase.scan-cache-rows1024  
kylin.storage.hbase.region-cut-gb5.0  
kylin.storage.hbase.max-scan-result-bytes5242880  
kylin.storage.hbase.compression-codecnone  
kylin.storage.hbase.rowkey-encodingFAST_DIFF  
kylin.storage.hbase.block-size-bytes1048576  
kylin.storage.hbase.small-family-block-size-bytes65536  
kylin.storage.hbase.owner-tag   
kylin.storage.hbase.endpoint-compress-resulttrue  
kylin.storage.hbase.max-hconnection-threads2048  
kylin.storage.hbase.core-hconnection-threads2048  
kylin.storage.hbase.hconnection-threads-alive-seconds60  
kylin.engine.mr.lib-dir   
kylin.engine.mr.reduce-input-mb500  
kylin.engine.mr.reduce-count-ratio1.0  
kylin.engine.mr.min-reducer-number1  
kylin.engine.mr.max-reducer-number500  
kylin.engine.mr.mapper-input-rows1000000  
kylin.engine.mr.max-cuboid-stats-calculator-number1  
kylin.engine.mr.uhc-reducer-count1  
kylin.engine.mr.build-uhc-dict-in-additional-stepfalse  
kylin.engine.mr.build-dict-in-reducertrue  
kylin.engine.mr.yarn-check-interval-seconds10  
kylin.env.hadoop-conf-dir Hadoop conf directory; If not specified, parse from environment.No
kylin.engine.spark.rdd-partition-cut-mb10.0Spark Cubing RDD partition split size.Yes
kylin.engine.spark.min-partition1Spark Cubing RDD min partition numberYes
kylin.engine.spark.max-partition5000RDD max partition numberYes
kylin.engine.spark.storage-levelMEMORY_AND_DISK_SERRDD persistent level.Yes
kylin.query.skip-empty-segmentstrueWhether directly skip empty segment (metadata shows size be 0) when run SQL query.Yes
kylin.query.force-limit-1  
kylin.query.max-scan-bytes0  
kylin.query.max-return-rows5000000  
kylin.query.large-query-threshold1000000  
kylin.query.cache-threshold-duration2000  
kylin.query.cache-threshold-scan-count10240  
kylin.query.cache-threshold-scan-bytes1048576  
kylin.query.security-enabledtrue  
kylin.query.cache-enabledtrue  
kylin.query.timeout-seconds0  
kylin.query.pushdown.runner-class-name   
kylin.query.pushdown.update-enabledfalse  
kylin.query.pushdown.cache-enabledfalse  
kylin.query.pushdown.jdbc.url   
kylin.query.pushdown.jdbc.driver   
kylin.query.pushdown.jdbc.username   
kylin.query.pushdown.jdbc.password   
kylin.query.pushdown.jdbc.pool-max-total8  
kylin.query.pushdown.jdbc.pool-max-idle8  
kylin.query.pushdown.jdbc.pool-min-idle0  
kylin.query.security.table-acl-enabledtrue No
kylin.server.modeallKylin node mode: all|job|query.No
kylin.server.cluster-serverslocalhost:7070 No
kylin.server.cluster-name  No
kylin.server.query-metrics-enabledfalse No
kylin.server.query-metrics2-enabledfalse No
kylin.server.auth-user-cache.expire-seconds300 No
kylin.server.auth-user-cache.max-entries100 No
kylin.server.external-acl-provider  No
kylin.security.ldap.user-search-base  No
kylin.security.ldap.user-group-search-base  No
kylin.security.acl.admin-role  No
kylin.web.timezonePST No
kylin.web.cross-domain-enabledtrue No
kylin.web.export-allow-admintrue No
kylin.web.export-allow-othertrue No
kylin.web.dashboard-enabledfalse No