Stand-alone deployment

Hardware Requirements
Install nearly 6 linkis microservices, at least 3G memory. The default jvm -Xmx memory size of each microservice is 512M (if the memory is not enough, you can try to reduce it to 256/128M, and you can also increase it if the memory is enough).

Deployment user: The starting user of the linkis core process, and this user will be the administrator by default. The corresponding administrator login password will be generated during the deployment process, located in conf/linkis-mg-gateway .propertiesfile Linkis supports specifying users for submission and execution. The linkis main process service will switch to the corresponding user through sudo -u ${linkis-user}, and then execute the corresponding engine start command, so the user of the engine linkis-engine process is the executor of the task (so the deployment The user needs to have sudo authority, and it is password-free).

Take hadoop users as an example (Many configuration users in linkis use hadoop users by default. It is recommended that first-time installers use hadoop users, otherwise many unexpected errors may be encountered during the installation process):

First check whether there is already a hadoop user in the system, if it already exists, just authorize it directly, if not, create a user first, and then authorize.

Check if hadoop user already exists

  1. $ id hadoop
  2. uid=2001(hadoop) gid=2001(hadoop) groups=2001(hadoop)

If it does not exist, you need to create a hadoop user and join the hadoop user group

  1. $ sudo useradd hadoop -g hadoop
  2. $ vi /etc/sudoers
  3. #Secret-free configuration
  4. hadoop ALL=(ALL) NOPASSWD: NOPASSWD: ALL

The following operations are performed under the hadoop user

Linkis depends on such engines as bellow,all the mandatory engines will be check in installation scripts${LINKIS_HOME}/bin/checkEnv.sh

EngineTypeNecessaryInstallation Guide
JDK(1.8.0 141)mandatoryInstall JDK and setting JAVA_HOME
mysql(5.5+)mandatoryMySQL installation
Python(3.6.8)mandatoryPython installation and user guide
Nginx(1.14.1)mandatoryNginx installation
Hadoop((2.7.2)mandatoryHadoop quickstart
Spark(2.4.3)mandatorySpark download and installtion
Hive(3.1.3)mandatoryHive installation

After uploading the installation package apache-linkis-xxx-bin.tar.gz, decompress the installation package

  1. $ tar -xvf apache-linkis-xxx-bin.tar.gz

The directory structure after decompression is as follows

  1. -rw-r--r-- 1 hadoop hadoop 518192043 Jun 20 09:50 apache-linkis-xxx-bin.tar.gz
  2. drwxrwxr-x 2 hadoop hadoop 4096 Jun 20 09:56 bin //execute environment check and install script
  3. drwxrwxr-x 2 hadoop hadoop 4096 Jun 20 09:56 deploy-config // Deployment dependent DB and other environment configuration information
  4. drwxrwxr-x 4 hadoop hadoop 4096 Jun 20 09:56 docker
  5. drwxrwxr-x 4 hadoop hadoop 4096 Jun 20 09:56 helm
  6. -rwxrwxr-x 1 hadoop hadoop 84732 Jan 22 2020 LICENSE
  7. drwxr-xr-x 2 hadoop hadoop 20480 Jun 20 09:56 licenses
  8. drwxrwxr-x 7 hadoop hadoop 4096 Jun 20 09:56 linkis-package // actual software package, including lib/service startup script tool/db initialization script/microservice configuration file, etc.
  9. -rwxrwxr-x 1 hadoop hadoop 119503 Jan 22 2020 NOTICE
  10. -rw-r--r-- 1 hadoop hadoop 11959 Jan 22 2020 README_CN.md
  11. -rw-r--r-- 1 hadoop hadoop 12587 Jan 22 2020 README.md

vim deploy-config/linkis-env.sh

  1. # Select linkis business database type, default mysql
  2. # If using postgresql, please change to postgresql
  3. # Note: The current configuration only applies to linkis>=1.4.0
  4. dbType=mysql

vim deploy-config/db.sh

  1. # Linkis's own business database information - mysql
  2. MYSQL_HOST=xx.xx.xx.xx
  3. MYSQL_PORT=3306
  4. MYSQL_DB=linkis_test
  5. MYSQL_USER=test
  6. MYSQL_PASSWORD=xxxxx
  7. # Linkis's own business database information - postgresql
  8. # Note: The following configuration is only applicable to linkis>=1.4.0
  9. PG_HOST=xx.xx.xx.xx
  10. PG_PORT=5432
  11. PG_DB=linkis_test
  12. PG_SCHEMA=linkis_test
  13. PG_USER=test
  14. PG_PASSWORD=123456
  15. # Provide the DB information of the Hive metadata database. If the hive engine is not involved (or just a simple trial), it is not necessary to configure
  16. #Mainly used together with scriptis, if not configured, it will try to get it through the configuration file in $HIVE_CONF_DIR by default
  17. HIVE_META_URL="jdbc:mysql://10.10.10.10:3306/hive_meta_demo?useUnicode=true&characterEncoding=UTF-8"
  18. HIVE_META_USER=demo # User of the HiveMeta metabase
  19. HIVE_META_PASSWORD=demo123 # Password of the HiveMeta metabase

The file is located at deploy-config/linkis-env.sh.

  1. deployUser=hadoop #The user who executes the deployment is the user created in step 1.2

Stand-alone deployment - 图1Caution

Determine whether it needs to be adjusted according to the actual situation, and you can choose to use the default value

  1. # Specify the directory path used by the user, which is generally used to store the user's script files and log files, etc., and is the user's workspace. The corresponding configuration file configuration item is wds.linkis.filesystem.root.path(linkis.properties)
  2. WORKSPACE_USER_ROOT_PATH=file:///tmp/linkis
  3. # The result set log and other file paths are used to store the result set file of the Job wds.linkis.resultSet.store.path(linkis-cg-entrance.properties) //If the configuration of HDFS_USER_ROOT_PATH is not configured
  4. RESULT_SET_ROOT_PATH=file:///tmp/linkis
  5. # Result set log and other file paths, used to store the result set file of Job wds.linkis.filesystem.hdfs.root.path(linkis.properties)
  6. HDFS_USER_ROOT_PATH=hdfs:///tmp/linkis
  7. # To store the working path of the execution engine, a local directory wds.linkis.engineconn.root.dir(linkis-cg-engineconnmanager.properties) where the deployment user has write permissions is required
  8. ENGINECONN_ROOT_PATH=/appcom/tmp

Stand-alone deployment - 图2Caution

If you need to use the Spark engine, you need to configure

  1. #You can check whether it can be accessed normally by visiting http://xx.xx.xx.xx:8088/ws/v1/cluster/scheduler interface
  2. YARN_RESTFUL_URL=http://xx.xx.xx.xx:8088

When executing the spark task, you need to use the ResourceManager of yarn. Linkis defaults that permission verification is not enabled. If the ResourceManager has enabled password permission verification, please install and deploy. Modify the database table linkis_cg_rm_external_resource_provider to insert yarn data information, for details, please refer to [Check whether the yarn address is configured correctly] (#811-Check whether the yarn address is configured correctly)

Stand-alone deployment - 图3Caution

It can be configured through the user’s system environment variables. If it is configured through the system environment variables, it can be commented out directly without configuration in the deploy-config/linkis-env.sh configuration file.

  1. ##If you do not use Hive, Spark and other engines and do not rely on Hadoop, you do not need to configure the following environment variables
  2. #HADOOP
  3. HADOOP_HOME=/appcom/Install/hadoop
  4. HADOOP_CONF_DIR=/appcom/config/hadoop-config
  5. #Hive
  6. HIVE_HOME=/appcom/Install/hive
  7. HIVE_CONF_DIR=/appcom/config/hive-config
  8. #Spark
  9. SPARK_HOME=/appcom/Install/spark
  10. SPARK_CONF_DIR=/appcom/config/spark-config

Stand-alone deployment - 图4Caution

The default is to use a static user and password. The static user is the deployment user. The static password will randomly generate a password string during deployment and store it in ${LINKIS_HOME}/conf/linkis-mg-gateway.properties(>=1.0. 3 version).

  1. #LDAP configuration, by default Linkis only supports deployment user login, if you need to support multi-user login, you can use LDAP, you need to configure the following parameters:
  2. #LDAP_URL=ldap://localhost:1389/
  3. #LDAP_BASEDN=dc=webank,dc=com

Microservice starts jvm memory configuration, which can be adjusted according to the actual situation of the machine. If the machine has less memory resources, you can try to reduce it to 256/128M

  1. ## java application default jvm memory
  2. export SERVER_HEAP_SIZE="512M"

Linkis will eventually be installed in this directory, if not configured, it will be in the same directory as the current installation package by default

  1. ##The decompression directory and the installation directory need to be inconsistent
  2. LINKIS_HOME=/appcom/Install/LinkisInstall

Deploy the Linkis service in an environment without HDFS to facilitate lighter learning, use and debugging. Deploying in HDFS mode does not support tasks such as hive/spark/flink engines

Modify linkis-env.sh file, modify the following content

  1. #Use [file://] path pattern instead of [hdfs://] pattern
  2. WORKSPACE_USER_ROOT_PATH=file:///tmp/linkis/
  3. HDFS_USER_ROOT_PATH=file:///tmp/linkis
  4. RESULT_SET_ROOT_PATH=file:///tmp/linkis
  5. export ENABLE_HDFS=false
  6. export ENABLE_HIVE=false
  7. export ENABLE_SPARK=false

Linkis does not enable kerberos authentication by default. If the hive cluster used enables kerberos authentication, the following parameters need to be configured.

Modify the linkis-env.sh file, the modified content is as follows

  1. #HADOOP
  2. HADOOP_KERBEROS_ENABLE=true
  3. HADOOP_KEYTAB_PATH=/appcom/keytab/

The file is located in bin/install.sh

Linkis 1.3.2 version has changed the Token value to 32-bit random generation to ensure system security. For details, please refer to [Token Change Description](https://linkis.apache.org/zh-CN/docs/1.3.2/ feature/update-token/).

Using randomly generated Token, you will encounter a lot of Token verification failure problems when connecting with WDS other components for the first time. It is recommended to install it for the first time When not using random generated Token, modify the following configuration to true.

  1. DEBUG_MODE=true

Full installation

For the full installation of the new version of Linkis, the install.sh script will automatically process the configuration file and keep the database Token consistent. Therefore, the Token of the Linkis service itself does not need to be modified. Each application can query and use the new token through the management console.

version upgrade

When the version is upgraded, the database Token is not modified, so there is no need to modify the configuration file and application Token.

Token expiration issue

When the Token token is invalid or has expired, you can check whether the Token is configured correctly. You can query the Token through the management console ==> Basic Data Management ==> Token Management.

Python version issue After Linkis is upgraded to 1.4.0, the default Spark version is upgraded to 3.x, which is not compatible with python2. Therefore, if you need to use the pyspark function, you need to make the following modifications.

  1. Map python2 commands to python3
  1. sudo ln -snf /usr/bin/python3 /usr/bin/python2
  1. Spark engine connector configuration $LINKIS_HOME/lib/linkis-engineconn-plugins/spark/dist/3.2.1/conf/linkis-engineconn.properties Add the following configuration to specify the python installation path
  1. pyspark.python3.path=/usr/bin/python3
  1. sh bin/install.sh

The install.sh script will ask you if you want to initialize the database and import metadata. If you choose to initialize, the table data in the database will be cleared and reinitialized.

You must choose to clear the database for the first installation

Stand-alone deployment - 图5note

  • If an error occurs, and it is not clear what command to execute to report the error, you can add the -x parameter sh -x bin/install.sh to print out the log of the shell script execution process, which is convenient for locating the problem.
  • Permission problem: mkdir: cannot create directory 'xxxx': Permission denied, please confirm whether the deployment user has read and write permissions for this path.

The prompt for successful execution is as follows:

  1. `Congratulations! You have installed Linkis xxx successfully, please use sh /data/Install/linkis/sbin/linkis-start-all.sh to start it!
  2. Your default account password is [hadoop/5e8e312b4]`

Stand-alone deployment - 图6Caution

Because the mysql-connector-java driver is under the GPL2.0 agreement, it does not meet the license policy of the Apache open source agreement. Therefore, starting from version 1.0.3, the official deployment package of the Apache version provided does not have mysql-connector-java-xxxjar by default. Dependency package (If you install it through the integrated family bucket material package, you don’t need to add it manually), you need to add dependencies to the corresponding lib package yourself when installing and deploying. You can check whether it exists in the corresponding directory. If it does not exist, you need to add it.

Download mysql driver Take version 8.0.28 as an example: [Download link](https://repo1.maven.org/maven2/mysql/mysql-connector-java/8.0.28/mysql-connector-java-8.0.28. jar)

Copy the mysql driver package to the lib package

  1. cp mysql-connector-java-8.0.28.jar ${LINKIS_HOME}/lib/linkis-spring-cloud-services/linkis-mg-gateway/
  2. cp mysql-connector-java-8.0.28.jar ${LINKIS_HOME}/lib/linkis-commons/public-module/

If you choose to use postgresql as the business database, you need to manually add the postgresql driver Download postgresql driver Take version 42.5.4 as an example: Download link Copy the postgresql driver package to the lib package

  1. cp postgresql-42.5.4.jar ${LINKIS_HOME}/lib/linkis-spring-cloud-services/linkis-mg-gateway/
  2. cp postgresql-42.5.4.jar ${LINKIS_HOME}/lib/linkis-commons/public-module/

The following operations are related to the dependent environment. According to the actual situation, determine whether the operation is required

When executing spark tasks, you need to use the ResourceManager of yarn, which is controlled by the configuration item YARN_RESTFUL_URL=http://xx.xx.xx.xx:8088. When performing installation and deployment, the YARN_RESTFUL_URL=http://xx.xx.xx.xx:8088 information will be updated to linkis_cg_rm_external_resource_provider in the database table. By default, access to yarn resources does not require authorization verification. If the resourcemanager of yarn has enabled the password authentication, please modify the yarn data information generated in the database table linkis_cg_rm_external_resource_provider after installation and deployment, For details, please refer to [Check whether the yarn address is configured correctly] (#811-Check whether the yarn address is configured correctly).

If you are an upgrade to Linkis. Deploy DSS or other projects at the same time, but the version of linkis introduced in other software is <1.1.1 (mainly in the lib package, the linkis-module-xxxjar package of Linkis that depends on it <1.1.1), you need to modify the $ {LINKIS_HOME}/conf/linkis.properties file.

  1. echo "wds.linkis.session.ticket.key=bdp-user-ticket-id" >> linkis.properties

Currently supports storing engine execution logs and results to the S3 file system

Note: linkis does not adapt permissions to S3, so it cannot perform authorization operations on it

vim $LINKIS_HOME/conf/linkis.properties

  1. # s3 file system
  2. linkis.storage.s3.access.key=xxx
  3. linkis.storage.s3.secret.key=xxx
  4. linkis.storage.s3.endpoint=http://xxx.xxx.xxx.xxx:xxx
  5. linkis.storage.s3.region=xxx
  6. linkis.storage.s3.bucket=xxx

vim $LINKIS_HOME/conf/linkis-cg-entrance.properties

  1. wds.linkis.entrance.config.log.path=s3:///linkis/logs
  2. wds.linkis.resultSet.store.path=s3:///linkis/results
  1. sh sbin/linkis-start-all.sh

After the installation is complete, if you need to modify the configuration (the configuration needs to be adjusted due to port conflicts or some configuration problems), you can re-execute the installation, or modify the configuration ${LINKIS_HOME}/conf/*properties file of the corresponding service, Restart the corresponding service, such as: sh sbin/linkis-daemon.sh start ps-publicservice.

Visit the eureka service page (http://eurekaip:20303), By default, 6 Linkis microservices will be started, and the linkis-cg-engineconn service in the figure below will only be started for running tasks. Linkis1.0_Eureka

  1. LINKIS-CG-ENGINECONNMANAGER Engine Management Service
  2. LINKIS-CG-ENTRANCE computing governance entry service
  3. LINKIS-CG-LINKISMANAGER Computing Governance Management Service
  4. LINKIS-MG-EUREKA Microservice Registry Service
  5. LINKIS-MG-GATEWAY Gateway Service
  6. LINKIS-PS-PUBLICSERVICE Public Service

Note: In Linkis 1.3.1, LINKIS-PS-CS, LINKIS-PS-DATA-SOURCE-MANAGER, LINKIS-PS-METADATAMANAGER services have been merged into LINKIS-PS-PUBLICSERVICE, and LINKIS-CG-ENGINEPLUGIN services have been merged into LINKIS -CG-LINKISMANAGER.

If any service is not started, you can check the detailed exception log in the corresponding log/${service name}.log file.

Linkis’s original default Token is fixed and the length is too short, posing security risks. Therefore, Linkis 1.3.2 changes the original fixed Token to random generation, and increases the length of the Token.

New Token format: application abbreviation - 32-bit random number, such as BML-928a721518014ba4a28735ec2a0da799.

Token may be used in the Linkis service itself, such as executing tasks through Shell, uploading BML, etc., or it may be used in other applications, such as DSS, Qualitis and other applications to access Linkis.

View via SQL statement

  1. select * from linkis_mg_gateway_auth_token;

View via Admin Console

Log in to the management console -> basic data management -> token management Stand-alone deployment - 图8

When the Linkis service itself uses Token, the Token in the configuration file must be consistent with the Token in the database. Match by applying the short name prefix.

$LINKIS_HOME/conf/linkis.properties file Token configuration

  1. linkis.configuration.linkisclient.auth.token.value=BML-928a721518014ba4a28735ec2a0da799
  2. wds.linkis.client.common.tokenValue=BML-928a721518014ba4a28735ec2a0da799
  3. wds.linkis.bml.auth.token.value=BML-928a721518014ba4a28735ec2a0da799
  4. wds.linkis.context.client.auth.value=BML-928a721518014ba4a28735ec2a0da799
  5. wds.linkis.errorcode.auth.token=BML-928a721518014ba4a28735ec2a0da799
  6. wds.linkis.client.test.common.tokenValue=LINKIS_CLI-215af9e265ae437ca1f070b17d6a540d
  7. wds.linkis.filesystem.token.value=WS-52bce72ed51741c7a2a9544812b45725
  8. wds.linkis.gateway.access.token=WS-52bce72ed51741c7a2a9544812b45725
  9. wds.linkis.server.dsm.auth.token.value=DSM-65169e8e1b564c0d8a04ee861ca7df6e

$LINKIS_HOME/conf/linkis-cli/linkis-cli.properties file Token configuration

  1. wds.linkis.client.common.tokenValue=BML-928a721518014ba4a28735ec2a0da799

When other applications use Token, they need to modify their Token configuration to be consistent with the Token in the database.

The web side uses nginx as the static resource server, and the access request process is: Linkis management console request->nginx ip:port->linkis-gateway ip:port->other services

  1. tar -xvf apache-linkis-xxx-web-bin.tar.gz
  1. #Access the port of the management console
  2. linkis_port="8188"
  3. #linkis-mg-gateway service address
  4. linkis_url="http://localhost:9020"
  1. # nginx needs sudo permission to install
  2. sudo sh install.sh

After installation, the nginx configuration file of linkis is in /etc/nginx/conf.d/linkis.conf by default The log files of nginx are in /var/log/nginx/access.log and /var/log/nginx/error.log An example of the generated nginx configuration file of the linkis management console is as follows:

  1. server {
  2. listen 8188;# If the access port is occupied, it needs to be modified
  3. server_name localhost;
  4. #charset koi8-r;
  5. #access_log /var/log/nginx/host.access.log main;
  6. location / {
  7. root /appcom/Install/linkis-web/dist; # static file directory
  8. index index.html index.html;
  9. }
  10. location /ws {
  11. proxy_pass http://localhost:9020;#The address of the backend Linkis
  12. proxy_http_version 1.1;
  13. proxy_set_header Upgrade $http_upgrade;
  14. proxy_set_header Connection upgrade;
  15. }
  16. location /api {
  17. proxy_pass http://localhost:9020; #The address of the backend Linkis
  18. proxy_set_header Host $host;
  19. proxy_set_header X-Real-IP $remote_addr;
  20. proxy_set_header x_real_ipP $remote_addr;
  21. proxy_set_header remote_addr $remote_addr;
  22. proxy_set_header X-Forwarded-For $proxy_add_x_forwarded_for;
  23. proxy_http_version 1.1;
  24. proxy_connect_timeout 4s;
  25. proxy_read_timeout 600s;
  26. proxy_send_timeout 12s;
  27. proxy_set_header Upgrade $http_upgrade;
  28. proxy_set_header Connection upgrade;
  29. }
  30. #error_page 404 /404.html;
  31. # redirect server error pages to the static page /50x.html
  32. #
  33. error_page 500 502 503 504 /50x.html;
  34. location = /50x.html {
  35. root /usr/share/nginx/html;
  36. }
  37. }

If you need to modify the port or static resource directory, etc., please modify the /etc/nginx/conf.d/linkis.conf file and execute the sudo nginx -s reload command

Stand-alone deployment - 图9Caution

  • At present, the visualis function is not integrated. During the installation process, if you are prompted to fail to install linkis/visualis, you can ignore it.
  • Check whether nginx starts normally: check whether the nginx process exists ps -ef |grep nginx.
  • Check whether the configuration of nginx is correct sudo nginx -T.
  • If the port is occupied, you can modify the service port /etc/nginx/conf.d/linkis.conflisten port value started by nginx, save and restart.
  • If there is an interface 502 when accessing the management console, or Unexpected token < in JSON at position 0 is abnormal, please confirm whether the linkis-mg-gateway is started normally. If it is started normally, check the linkis-mg-gateway configured in the nginx configuration file Whether the service address is correct.

Browser login http://xx.xx.xx.xx:8188/#/login Username/password can be checked in {LINKIS_HOME}/conf/linkis-mg-gateway.properties.

  1. wds.linkis.admin.user= #user
  2. wds.linkis.admin.password= #password

Verify the corresponding engine tasks according to actual needs

  1. #The version number of the engineType stitching of the engine must match the actual one. The following example is the default version number
  2. #shell engine tasks
  3. sh bin/linkis-cli -submitUser hadoop -engineType shell-1 -codeType shell -code "whoami"
  4. #hive engine tasks
  5. sh bin/linkis-cli -submitUser hadoop -engineType hive-3.1.3 -codeType hql -code "show tables"
  6. #spark engine tasks
  7. sh bin/linkis-cli -submitUser hadoop -engineType spark-3.2.1 -codeType sql -code "show tables"
  8. #python engine tasks
  9. sh bin/linkis-cli -submitUser hadoop -engineType python-python2 -codeType python -code 'print("hello, world!")'

If the verification fails, please refer to [Step 8] for troubleshooting.

After installing the Scripti tool, you can write SQL, Pyspark, HiveQL and other scripts online on the web page. For detailed instructions, see [Tool Scriptis Installation and Deployment] (integrated/install-scriptis).

Please note: the separate installation package of Linkis only includes Python, Shell, Hive, and Spark by default. If there are other engine usage scenarios (such as jdbc/flink/sqoop, etc.), you can install them manually. For details, please refer to EngineConnPlugin Engine Plugin installation documentation.

The list of supported engines adapted to this version is as follows:

Engine typeAdaptation situationWhether the official installation package contains
Python>=1.0.0 AdaptedContains
Shell>=1.0.0 adaptedcontains
Hive>=1.0.0 adaptedcontains
Spark>=1.0.0 adaptedcontains
Pipeline>=1.0.0 AdaptedNot Included
JDBC>=1.0.0 AdaptedNot Included
Flink>=1.0.0 AdaptedExcludes
openLooKeng>=1.1.1 AdaptedNot Included
Sqoop>=1.1.2 AdaptedNot Included
Trino>=1.3.2 AdaptedExcluded
Presto>=1.3.2 AdaptedExcluded
Elasticsearch>=1.3.2 AdaptedExcludes
Seatunnel>=1.3.2 AdaptedNot Included
Impala>=1.4.0 AdaptedExcludes
  1. $ tree linkis-package/lib/linkis-engineconn-plugins/ -L 3
  2. linkis-package/lib/linkis-engineconn-plugins/
  3. ├──hive
  4. ├── dist
  5. └── 3.1.3 #version is 3.1.3 engineType is hive-3.1.3
  6. └── plugin
  7. └── 3.1.3
  8. ├── python
  9. ├── dist
  10. └── python2
  11. └── plugin
  12. └── python2 #version is python2 engineType is python-python2
  13. ├── shell
  14. ├── dist
  15. └── 1
  16. └── plugin
  17. └── 1
  18. └── spark
  19. ├── dist
  20. └── 3.2.1
  21. └── plugin
  22. └── 3.2.1
  1. select * from linkis_cg_engine_conn_plugin_bml_resources

The additonal engines check are done manually by executing the scriptsh $LINKIS_HOME/bin/checkAdd.sh ${engineType}. Please refer to the directory ($LINKIS_HOME/bin/checkAdd.sh) . The specific checking method is as follows:

  1. function print_usage(){
  2. echo "Usage: checkAdd [EngineName]"
  3. echo " EngineName : The Engine name that you want to check"
  4. echo " Engine list as bellow: JDBC Flink openLooKeng Pipeline Presto Sqoop Elasticsearch "
  5. }

The parameters used in the addtional engines checking process are divided into two categories: one for the data engine connection information, defined in $LINKIS_HOME/deploy-config/db.sh; the other is the reference parameters, including check switches, version definitions, Java paths, etc., defined in $LINKIS_HOME/deploy-config/db.sh. engines and parameters descriptions are as follows: | EngineType | Parameters | parameter description | |———————-|——————————|———————————| | JDBC | ${MYSQL_HOST}, ${MYSQL_PORT}, ${MYSQL_DB}, ${MYSQL_USER}, ${MYSQL_PASSWORD} | MySQL engine connection information, including host IP, port, database name, username, password| | JDBC | ${MYSQL_CONNECT_JAVA_PATH} | MySQL JDBC Driver directory| | Flink | ${FLINK_HOME} | he installation directory of Flink, including Flink execution scripts and samples | | openLooKeng | ${OLK_HOST}, ${OLK_PORT}, ${OLK_CATALOG}, ${OLK_SCHEMA}, {OLK_USER}, ${OLK_PASSWORD}| openLooKeng engine connection information, including host IP, port, catalog, schema, username, password| | openLooKeng | ${OLK_JDBC_PATH} | openLooKeng connector directory| | Presto | ${PRESTO_HOST}, ${PRESTO_PORT}, ${PRESTO_CATALOG}, ${PRESTO_SCHEMA}| Presto engine connection information, including host IP, port, catalog, schema| | Sqoop | ${HIVE_META_URL}, ${HIVE_META_USER}, ${HIVE_META_PASSWORD}| sqoop connection information for connecting to Hive, including service address, username, password| | Elasticsearch | ${ES_RESTFUL_URL} | Elasticsearch RestfulAPI URL | | Impala | ${IMPALA_HOST}, ${IMPALA_PORT}| Impala connection information, including host IP and port| | Trino | ${TRINO_COORDINATOR_HOST}, ${TRINO_COORDINATOR_PORT}, ${TRINO_COORDINATOR_CATALOG}, ${TRINO_COORDINATOR_SCHEMA}| Trino connection information, including host IP, port, catalog, and schema| | Seatunnel | ${SEATUNNEL_HOST}, ${SEATUNNEL_PORT} | Seatunnel connection information, including host IP and port|

Database table `linkis_cg_rm_external_resource_provider`` Insert yarn data information

  1. INSERT INTO `linkis_cg_rm_external_resource_provider`
  2. (`resource_type`, `name`, `labels`, `config`) VALUES
  3. ('Yarn', 'default', NULL,
  4. '{\r\n"rmWebAddress": "http://xx.xx.xx.xx:8088",\r\n"hadoopVersion": "3.3.4",\r\n"authorEnable":false, \r\n"user":"hadoop",\r\n"pwd":"123456"\r\n}'
  5. );
  6. config field attribute
  7. "rmWebAddress": "http://xx.xx.xx.xx:8088", #Need to bring http and port
  8. "hadoopVersion": "3.3.4",
  9. "authorEnable":true, //Whether authentication is required You can verify the username and password by visiting http://xx.xx.xx.xx:8088 in the browser
  10. "user": "user", //username
  11. "pwd": "pwd"//password

After the update, because the cache is used in the program, if you want to take effect immediately, you need to restart the linkis-cg-linkismanager service.

  1. sh sbin/linkis-daemon.sh restart cg-linkismanager

Exception information: desc: queue ide is not exists in YARN. indicates that the configured yarn queue does not exist and needs to be adjusted.

Modification method: linkis management console/parameter configuration>global settings>yarn queue name [wds.linkis.rm.yarnqueue], modify a yarn queue that can be used, and the yarn queue to be used can be found at rmWebAddress:http:// xx.xx.xx.xx:8088/cluster/scheduler.

View available yarn queues

  1. #Log in to the linkis database
  2. select * from linkis_cg_engine_conn_plugin_bml_resources

Normally as follows: bml

Check whether the material record of the engine exists (if there is an update, check whether the update time is correct)

  • If it does not exist or is not updated, first try to manually refresh the material resource (see [Engine Material Resource Refresh](install-engineconn#23-engine refresh) for details).
  • Use log/linkis-cg-linkismanager.log to check the specific reason for the failure of the material. In many cases, it may be caused by the lack of permission in the hdfs directory.
  • Check whether the gateway address configuration is correct. The configuration item wds.linkis.gateway.url in conf/linkis.properties.

The material resources of the engine are uploaded to the hdfs directory by default as /apps-data/${deployUser}/bml.

  1. hdfs dfs -ls /apps-data/hadoop/bml
  2. #If there is no such directory, please manually create the directory and grant ${deployUser} read and write permissions
  3. hdfs dfs -mkdir /apps-data
  4. hdfs dfs -chown hadoop:hadoop /apps-data

Linkis uses static users and passwords by default. Static users are deployment users. Static passwords will randomly generate a password string during deployment and store it in

${LINKIS_HOME}/conf/linkis-mg-gateway.properties (>=version 1.0.3).

The engine supported by linkis by default, and the compatibility relationship with dss can be viewed in this document.

Linkis has many microservices. If you are not familiar with the system, sometimes you cannot locate the specific module that has an exception. You can search through the global log.

  1. tail -f log/* |grep -5n exception (or tail -f log/* |grep -5n ERROR)
  2. less log/* |grep -5n exception (or less log/* |grep -5n ERROR)

step1: Find the startup deployment directory of the engine

  • Method 1: If it is displayed in the execution log, you can view it on the management console as shown below:
    engine-log
  • Method 2: If not found in method 1, you can find the wds.linkis.engineconn.root.dir parameter configured in conf/linkis-cg-engineconnmanager.properties, and this value is the directory where the engine starts and deploys. Subdirectories are segregated by user of the execution engine
  1. # If you don't know the taskid, you can sort by time and choose ll -rt /appcom/tmp/${executed user}/${date}/${engine}/
  2. cd /appcom/tmp/${user executed}/${date}/${engine}/${taskId}

The directory is roughly as follows

  1. conf -> /appcom/tmp/engineConnPublicDir/6a09d5fb-81dd-41af-a58b-9cb5d5d81b5a/v000002/conf #engine configuration file
  2. engineConnExec.sh #generated engine startup script
  3. lib -> /appcom/tmp/engineConnPublicDir/45bf0e6b-0fa5-47da-9532-c2a9f3ec764d/v000003/lib #engine-dependent packages
  4. logs #Related logs of engine startup execution

step2: Check the log of the engine

  1. less logs/stdout

step3: Try to execute the script manually (if needed)
You can debug by trying to execute the script manually

  1. sh -x engineConnExec.sh

Sometimes when the eureka port is occupied by other services and the default eureka port cannot be used, it is necessary to modify the eureka port. Here, the modification of the eureka port is divided into two cases: before the installation and after the installation.

  1. Modify the eureka port of the registration center before performing the installation
  1. 1. Enter the decompression directory of apache-linkis-xxx-bin.tar.gz
  2. 2. Execute vi deploy-config/linkis-env.sh
  3. 3. Modify EUREKA_PORT=20303 to EUREKA_PORT=port number
  1. Modify the registry eureka port after installation
  1. 1. Enter the ${LINKIS_HOME}/conf directory
  2. 2. Execute grep -r 20303 ./* , the query results are as follows:
  3. ./application-eureka.yml: port: 20303
  4. ./application-eureka.yml: defaultZone: http://ip:20303/eureka/
  5. ./application-linkis.yml: defaultZone: http://ip:20303/eureka/
  6. ./linkis-env.sh:EUREKA_PORT=20303
  7. ./linkis.properties:wds.linkis.eureka.defaultZone=http://ip:20303/eureka/
  8. 3. Change the port at the corresponding location to a new port, and restart all services sh restart sbin/linkis-start-all.sh

CDH itself is not an official standard hive/spark package. When adapting, it is best to modify the hive/spark version dependencies in the linkis source code and recompile and deploy.
For details, please refer to the CDH adaptation blog post
[Linkis1.0——Installation and stepping in the CDH5 environment]
[DSS1.0.0+Linkis1.0.2——Trial record in CDH5 environment]
[DSS1.0.0 and Linkis1.0.2 - Summary of JDBC engine-related issues]
[DSS1.0.0 and Linkis1.0.2——Summary of issues related to Flink engine]

  • Method 1 can enable [Guide to Free Login Mode](/docs/latest/api/login-api/#2 Login-free configuration)
  • In method 2 postman, the cookie value of successful login on the request header The cookie value can be obtained after successful login on the browser side bml
  1. Cookie: bdp-user-ticket-id=xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
  • Method 3 http request header to add a static Token token
    Token is configured in conf/linkis.properties Such as: TEST-AUTH=hadoop,root,user01
  1. Token-Code: TEST-AUTH
  2. Token-User: hadoop

First, check whether the service/environment is started normally according to the above steps, and then check the basic problems according to some scenarios listed above.

QA document Find out if there is a solution, link: https://docs.qq.com/doc/DSGZhdnpMV3lTUUxq
See if you can find a solution by searching the contents of the issue.
issues
Through the official website document search, for some questions, you can search keywords on the official website, such as searching for “deployment”. (If 404 appears, please refresh the browser)
search

Linkis official website documents are constantly being improved, and you can view related documents on this official website.

Related blog posts are linked below.