How to Build

Environment requirements: Version of JDK must be higher than JDK8, Oracle/Sun and OpenJDK are both supported.

After obtaining the project code from github repository https://github.com/apache/linkis, use maven to compile the project installation package.

Notice : The official recommended versions for compiling Linkis are hadoop-2.7.2, hive-1.2.1, spark-2.4.3, and Scala-2.11.12.

If you want to compile Linkis with another version of Hadoop, Hive, Spark, please refer to: How to Modify Linkis dependency of Hadoop, Hive, Spark

How to Build - 图1Note

Because the mysql-connector-java driver is under the GPL2.0 agreement and does not meet the license policy of the Apache open source agreement, starting from version 1.0.3, the scope of the dependency on mysql-connector-java is test by default. If you compile by yourself , You can modify the scope that the mysql-connector-java of the top-level pom.xml depends on (just comment it out)

  1. <dependency>
  2. <groupId>mysql</groupId>
  3. <artifactId>mysql-connector-java</artifactId>
  4. <version>${mysql.connector.version}</version>
  5. <!--<scope>test</scope>-->
  6. </dependency>

If you are compiling and using it locally for the first time, you must first execute the following command in the root directory of the Linkis source code package:

  1. cd linkis-x.x.x
  2. mvn -N install

Execute the following commands in the root directory of the Linkis source code package:

  1. cd linkis-x.x.x
  2. mvn clean install

The compiled complete installation package is in the linkis-dist->target directory of the project:

  1. #Detailed path is as follows
  2. linkis-x.x.x/linkis-dist/target/apache-linkis-x.x.x-incubating-bin.tar.gz

If you are compiling and using it locally for the first time, you must first execute the following command in the root directory of the Linkis source code package:

  1. cd linkis-x.x.x
  2. mvn -N install

Enter the corresponding module to compile, for example, if you want to recompile Entrance, the command is as follows:

  1. cd linkis-x.x.x/linkis-computation-governance/linkis-entrance
  2. mvn clean install

Get the installation package, there will be a compiled package in the ->target directory of the corresponding module:

  1. linkis-x.x.x/linkis-computation-governance/linkis-entrance/target/linkis-entrance.x.x.x.jar

Here’s an example of the Spark engine that builds Linkis:

If you are using it locally for the first time, you must first execute the following command in the root directory of the Linkis source code package:

  1. cd linkis-x.x.x
  2. mvn -N install

Enter the directory where the Spark engine is located to compile and package, the command is as follows:

  1. cd linkis-x.x.x/linkis-engineconn-pluginsspark
  2. mvn clean install

Get the installation package, there will be a compiled package in the ->target directory of the corresponding module:

  1. linkis-x.x.x/linkis-engineconn-pluginsspark/target/linkis-engineplugin-spark-x.x.x.jar

How to install Spark engine separately? Please refer to Linkis Engine Plugin Installation Document

Please note: Hadoop is a big data basic service, Linkis must rely on Hadoop for compilation; If you don’t want to use an engine, you don’t need to set the version of the engine or compile the engine plug-in.

Specifically, the version of Hadoop can be modified in a different way than Spark, Hive, and other computing engines, as described below:

Enter the source package root directory of Linkis, and manually modify the Hadoop version information of the pom.xml file, as follows:

  1. cd linkis-x.x.x
  2. vim pom.xml
  1. <properties>
  2. <hadoop.version>2.7.2</hadoop.version> <!--> Modify the Hadoop version number here <-->
  3. <scala.version>2.11.12</scala.version>
  4. <jdk.compile.version>1.8</jdk.compile.version>
  5. </properties>

Please note: If your hadoop version is hadoop3, you need to modify the pom file of linkis-hadoop-common Because under hadoop2.8, hdfs-related classes are in the hadoop-hdfs module, but in hadoop 3.X the corresponding classes are moved to the module hadoop-hdfs-client, you need to modify this file: pom:Linkis/linkis-commons/linkis-hadoop-common/pom.xml Modify the dependency hadoop-hdfs to hadoop-hdfs-client:

  1. <dependency>
  2. <groupId>org.apache.hadoop</groupId>
  3. <artifactId>hadoop-hdfs</artifactId> <!-- Just replace this line with <artifactId>hadoop-hdfs-client</artifactId>-->
  4. <version>${hadoop.version}</version>
  5. </dependency>
  6. Modify hadoop-hdfs to:
  7. <dependency>
  8. <groupId>org.apache.hadoop</groupId>
  9. <artifactId>hadoop-hdfs-client</artifactId>
  10. <version>${hadoop.version}</version>
  11. </dependency>

Here’s an example of changing the version of Spark. Go to the directory where the Spark engine is located and manually modify the Spark version information of the pom.xml file as follows:

  1. cd linkis-x.x.x/linkis-engineconn-pluginsspark
  2. vim pom.xml
  1. <properties>
  2. <spark.version>2.4.3</spark.version> <!--> Modify the Spark version number here <-->
  3. </properties>

Modifying the version of other engines is similar to modifying the Spark version. First, enter the directory where the relevant engine is located, and manually modify the engine version information in the pom.xml file.

Then please refer to 4. Compile an engine

You can use the -pl option of mvn command, please refer below for details

  1. -pl,--projects <arg> Comma-delimited list of specified
  2. reactor projects to build instead
  3. of all projects. A project can be
  4. specified by [groupId]:artifactId
  5. or by its relative path.

Implement the reverse selection by using ! to exclude the given engines so that shorten the consumed time for fully compile. Here we take flink, sqoop and hive as an example, and exclude them during fully compile:

  1. mvn clean install -Dmaven.test.skip=true \
  2. -pl '!linkis-engineconn-plugins/flink,!linkis-engineconn-plugins/sqoop,!linkis-engineconn-plugins/hive'