Glossary

Linkis is developed based on the microservice architecture, and its services can be divided into 3 types of service groups (groups): computing governance service group, public enhancement service group and microservice governance service group.

  • Computation Governance Services: The core service for processing tasks, supporting the 4 main stages of the computing task/request processing flow (submit->prepare->execute->result);
  • Public Enhancement Services: Provide basic support services, including context services, engine/udf material management services, job history and other public services and data source management services;
  • Microservice Governance Services: Customized Spring Cloud Gateway, Eureka. Provides a base for microservices

The following will introduce the key Glossary and services of these three groups of services:

AbbreviationNameMain Functions
MG/mgMicroservice GovernanceMicroservice Governance
CG/cgComputation GovernanceComputation Governance
EC/ecEngineConnEngine Connector
-EngineThe underlying computing storage engine, such as spark, hive, shell
ECM/ecmEngineConnManagerManagement of Engine Connectors
ECP/ecpEngineConnPluginEngine Connector Plugin
RM/rmResourceManagerResource manager for managing task resource and user resource usage and control
AM/amAppManagerApplication Manager to manage EngineConn and ECM services
LM/lmLinkisManagerLinkis manager service, including: RM, AM, LabelManager and other modules
PES/pesPublic Enhancement Services
-OrchestratorOrchestrator, used for Linkis task orchestration, task multi-active, mixed calculation, AB and other policy support
UJESUnified Job Execute ServiceUnified Job Execute Service
DDL/ddlData Definition LanguageDatabase Definition Language
DML/dmlData Manipulation LanguageData Manipulation Language
  • JobRequest: job request, corresponding to the job submitted by the Client to Linkis, including the execution content, user, label and other information of the job
  • RuntimeMap: task runtime parameters, task level take effect, such as data source information for placing multiple data sources
  • StartupMap: Engine connector startup parameters, used to start the EngineConn connected machine, the EngineConn process takes effect, such as setting spark.executor.memory=4G
  • UserCreator: Task creator information: contains user information User and Client submitted application information Creator, used for tenant isolation of tasks and resources
  • submitUser: task submit user
  • executeUser: the real execution user of the task
  • JobSource: Job source information, record the IP or script address of the job
  • errorCode: error code, task error code information
  • JobHistory: task history persistence module, providing historical information query of tasks
  • ResultSet: The result set, the result set corresponding to the task, is saved with the .dolphin file suffix by default
  • JobInfo: Job runtime information, such as logs, progress, resource information, etc.
  • Resource: resource information, each task consumes resources
  • RequestTask: The smallest execution unit of EngineConn, the task unit transmitted to EngineConn for execution

This section mainly introduces the services of Linkis, what services will be available after Linkis is started, and the functions of the services.

After Linkis is started, the microservices included in each service group (group) are as follows:

Belonging to the microservice group (group)Service nameMain functions
MGSlinkis-mg-eurekaResponsible for service registration and discovery, other upstream components will also reuse the linkis registry, such as dss
MGSlinkis-mg-gatewayAs the gateway entrance of Linkis, it is mainly responsible for request forwarding and user access authentication
CGSlinkis-cg-entranceThe task submission entry is a service responsible for receiving, scheduling, forwarding execution requests, and life cycle management of computing tasks, and can return calculation results, logs, and progress to the caller
CGSlinkis-cg-linkismanagerProvides AppManager (application management), ResourceManager (resource management), LabelManager (label management), Engine connector plug-in manager capabilities
CGSlinkis-cg-engineconnmanagerManager for EngineConn, providing lifecycle management of engines
CGSlinkis-cg-engineconnThe engine connector service is the actual connection service with the underlying computing storage engine (Hive/Spark), including session information with the actual engine. For the underlying computing storage engine, it acts as a client and is triggered and started by tasks
PESlinkis-ps-publicservicePublic Enhanced Service Group Module Service, which provides functions such as unified configuration management, context service, BML material library, data source management, microservice management, and historical task query for other microservice modules

All services seen by open source after startup are as follows: Linkis_Eureka

After version 1.3.1, the Public Enhanced Service Group (PES) merges related module services into one service linkis-ps-publicservice by default to provide related functions. Of course, if you want to deploy separately, it is also supported. You only need to package and deploy the services of the corresponding modules. The combined public enhanced service mainly includes the following functions:

AbbreviationService NameMain Functions
CS/csContext ServiceContext Service, used to transfer result sets, variables, files, etc. between tasks
UDF/udfUDFUDF management module, provides management functions for UDF and functions, supports sharing and version control
variableVariableGlobal custom module, providing management functions for global custom variables
scriptScript-devScript file operation service, providing script editing and saving, script directory management functions
jobHistoryJobHistoryTask history persistence module, providing historical information query of tasks
BML/bmlBigData Material library
-ConfigurationConfiguration management, providing management and viewing of configuration parameters
-instance-labelMicroservice management service, providing mapping management functions for microservices and routing labels
-error-codeError code management, providing the function of managing through error codes
DMS/dmsData Source Manager ServiceData Source Management Service
MDS/mdsMetaData Manager ServiceMetadata Management Service
-linkis-metadataProvides Hive metadata information viewing function, which will be merged into MDS later
-basedata-managerBasic data management, used to manage Linkis’ own basic metadata information

This section mainly introduces the major modules and functions of Linkis.

  • linkis-commons: The public modules of linkis, including public tool modules, RPC modules, microservice foundation and other modules
  • linkis-computation-governance: Computing governance module, including modules for computing governance multiple services: Entrance, LinkisManager, EngineConnManager, EngineConn, etc.
  • linkis-engineconn-plugins: Engine connector plugin module, contains all engine connector plugin implementations
  • linkis-extensions: The extension enhancement module of Linkis, not a necessary function module, now mainly includes the IO module for file proxy operation
  • linkis-orchestrator: Orchestration module for Linkis task orchestration, advanced strategy support such as task multi-active, mixed calculation, AB, etc.
  • linkis-public-enhancements: public enhancement module, which contains all public services for invoking linkis internal and upper-layer application components
  • linkis-spring-cloud-services: Spring cloud related service modules, including gateway, registry, etc.
  • linkis-web: front-end module