Jobs
A Job creates one or more Pods and ensures that a specified number of them successfully terminates. As Pods successfully complete, the Job tracks the successful completions. When a specified number of successful completions is reached, the task (namely, Job) is complete. Deleting a Job will clean up the Pods it created.
A simple case is to create one Job object in order to reliably run one Pod to completion. The Job object will start a new Pod if the first Pod fails or is deleted (for example, due to a node hardware failure or a node reboot). You can also use a Job to run multiple Pods in parallel.
The following example demonstrates specific steps of creating a Job (computing π to 2000 decimal places) on KubeSphere.
Prerequisites
You need to create a workspace, a project and a user (project-regular
). The user must be invited to the project with the role of operator
. For more information, see Create Workspaces, Projects, Users and Roles.
Create a Job
Step 1: Open the dashboard
Log in to the console as project-regular
. Go to Jobs under Application Workloads and click Create.
Step 2: Enter basic information
Enter the basic information. Refer to the image below as an example.
- Name: The name of the Job, which is also the unique identifier.
- Alias: The alias name of the Job, making resources easier to identify.
- Description: The description of the Job, which gives a brief introduction of the Job.
Step 3: Strategy settings (optional)
You can set the values in this step or click Next to use the default values. Refer to the table below for detailed explanations of each field.
Name | Definition | Description |
---|---|---|
Maximum Retries | spec.backoffLimit | It specifies the maximum number of retries before this Job is marked as failed. It defaults to 6. |
Complete Pods | spec.completions | It specifies the desired number of successfully finished Pods the Job should be run with. Setting it to nil means that the success of any Pod signals the success of all Pods, and allows parallelism to have any positive value. Setting it to 1 means that parallelism is limited to 1 and the success of that Pod signals the success of the Job. For more information, see Jobs. |
Parallel Pods | spec.parallelism | It specifies the maximum desired number of Pods the Job should run at any given time. The actual number of Pods running in a steady state will be less than this number when the work left to do is less than max parallelism ((.spec.completions - .status.successful ) < .spec.parallelism ). For more information, see Jobs. |
Maximum Duration (s) | spec.activeDeadlineSeconds | It specifies the duration in seconds relative to the startTime that the Job may be active before the system tries to terminate it; the value must be a positive integer. |
Step 4: Set a Pod
Select Re-create Pod for Restart Policy. You can only specify Re-create Pod or Restart container for Restart Policy when the Job is not completed:
If Restart Policy is set to Re-create Pod, the Job creates a new Pod when the Pod fails, and the failed Pod does not disappear.
If Restart Policy is set to Restart container, the Job will internally restart the container when the Pod fails, instead of creating a new Pod.
Click Add Container which directs you to the Add Container page. Enter
perl
in the image search box and press Enter.On the same page, scroll down to Start Command. Enter the following commands in the box which computes pi to 2000 places then prints it. Click √ in the lower-right corner and select Next to continue.
perl,-Mbignum=bpi,-wle,print bpi(2000)
Note
For more information about setting images, see Pod Settings.
Step 5: Inspect the Job manifest (optional)
Enable Edit YAML in the upper-right corner which displays the manifest file of the Job. You can see all the values are set based on what you have specified in the previous steps.
apiVersion: batch/v1
kind: Job
metadata:
namespace: demo-project
labels:
app: job-test-1
name: job-test-1
annotations:
kubesphere.io/alias-name: Test
kubesphere.io/description: A job test
spec:
template:
metadata:
labels:
app: job-test-1
spec:
containers:
- name: container-4rwiyb
imagePullPolicy: IfNotPresent
image: perl
command:
- perl
- '-Mbignum=bpi'
- '-wle'
- print bpi(2000)
restartPolicy: Never
serviceAccount: default
initContainers: []
volumes: []
imagePullSecrets: null
backoffLimit: 5
completions: 4
parallelism: 2
activeDeadlineSeconds: 300
You can make adjustments in the manifest directly and click Create or disable the Edit YAML and get back to the Create page.
Note
You can skip Volume Settings and Advanced Settings for this tutorial. For more information, see Mount volumes and Configure advanced settings.
Step 6: Check the result
In the final step of Advanced Settings, click Create to finish. A new item will be added to the Job list if the creation is successful.
Click this Job and go to Job Records where you can see the information of each execution record. There are four completed Pods since Completions was set to
4
in Step 3.Tip
You can rerun the Job if it fails and the reason for failure is displayed under Message.
In Resource Status, you can inspect the Pod status. Two Pods were created each time as Parallel Pods was set to 2. Click on the right and click to check the container log, which displays the expected calculation result.
Tip
- In Resource Status, the Pod list provides the Pod’s detailed information (for example, creation time, node, Pod IP and monitoring data).
- You can view the container information by clicking the Pod.
- Click the container log icon to view the output logs of the container.
- You can view the Pod details page by clicking the Pod name.
Check Job Details
Operations
On the Job details page, you can manage the Job after it is created.
- Edit Information: Edit the basic information except
Name
of the Job. - Rerun: Rerun the Job, the Pod will restart, and a new execution record will be generated.
- View YAML: View the Job’s specification in YAML format.
- Delete: Delete the Job and return to the Job list page.
Execution records
Click the Job Records tab to view the execution records of the Job.
Click to refresh the execution records.
Resource status
Click the Resource Status tab to view the Pods of the Job.
Click to refresh the Pod information, and click / to display/hide the containers in each Pod.
Metadata
Click the Metadata tab to view the labels and annotations of the Job.
Environment variables
Click the Environment Variables tab to view the environment variables of the Job.
Events
Click the Events tab to view the events of the Job.