书栈网 · BookStack 本次搜索耗时 0.028 秒,为您找到 621 个相关结果.
  • CLI

    CLI Local set up Hudi CLI Bundle setup Base path Using Hudi-cli in S3 Note: These AWS jar versions below are specific to Spark 3.2.0 Using hudi-cli on Google Dataproc Connec...
  • 对比

    对比 Kudu Hive事务 HBase 流式处理 对比 Apache Hudi填补了在DFS上处理数据的巨大空白,并可以和这些技术很好地共存。然而, 通过将Hudi与一些相关系统进行对比,来了解Hudi如何适应当前的大数据生态系统,并知晓这些系统在设计中做的不同权衡仍将非常有用。 Kudu Apache Kudu 是一个与Hudi具有...
  • AWS S3

    S3 Filesystem AWS configs AWS Credentials AWS Libs S3 Filesystem In this page, we explain how to get your Hudi spark job to store into AWS S3. AWS configs There are two c...
  • AWS S3

    S3 Filesystem AWS configs AWS Credentials AWS Libs S3 Filesystem In this page, we explain how to get your Hudi spark job to store into AWS S3. AWS configs There are two c...
  • Bootstrapping

    Bootstrapping Approaches Use Hudi for new partitions alone Convert existing table to Hudi Using Hudi Streamer Using Spark Datasource Writer Using Spark SQL CALL Procedure Using...
  • Table Services

    Table Services FAQ What does the Hudi cleaner do? How do I run compaction for a MOR table? What options do I have for asynchronous/offline compactions on MOR table? How to disab...
  • Concurrency Control

    Concurrency Control Distributed Locking Zookeeper based HiveMetastore based Amazon DynamoDB based FileSystem based (not for production use) Simple Single writer + table servi...
  • Microsoft Azure

    Azure Filesystem Disclaimer Supported Storage System Verified Combination of Spark and storage system HDInsight Spark2.4 on Azure Data Lake Storage Gen 2 Databricks Spark2.4 on ...
  • Microsoft Azure

    Azure Filesystem Disclaimer Supported Storage System Verified Combination of Spark and storage system HDInsight Spark2.4 on Azure Data Lake Storage Gen 2 Databricks Spark2.4 on ...
  • Microsoft Azure

    Azure Filesystem Disclaimer Supported Storage System Verified Combination of Spark and storage system HDInsight Spark2.4 on Azure Data Lake Storage Gen 2 Databricks Spark2.4 on ...