Starting a Job Run

Use Data Science job runs to apply various use cases to created jobs.

Job creation sets the infrastructure and the actual use case artifact, but the job run runs the job with the specified parameters. Job runs provision the specified infrastructure, run the job artifact, and then deprovision and destroy the used resources when the job run ends.

- If you're starting a single node job run, follow the steps in Using the Console to Start Single Node Job Runs.
- If you're starting a multi node job run, follow the steps in Using the Console to Start Multi Node Job Runs.
Using the Console to Start Single Node Job Runs
Using the Console to Start Single Node Job Runs

From the jobs list page, select the name of the job to work with. If you need help finding the list of jobs, see Listing Jobs.
Select Start a job run.
(Optional) Select a different compartment for the job run.
(Optional) Enter a name for the job run (limit of 255 characters). If you don't provide a name, a name is automatically generated.

For example, jobrun20210808222435.

(Optional) Override the default job configuration that was defined when the job was created by using these options:
Enter or select any of the following values:

Custom environment variable key

The environment variables that control the job.

Value

The value for the custom environment variable key.

You can select Additional custom environment key to specify more variables.

Command line arguments

The command line arguments that you want to use for running the job.

Maximum runtime (in minutes)

The maximum number of minutes that the job can run. The service cancels the job run if its runtime exceeds the specified value. The maximum runtime is 30 days (43,200 minutes). We recommend that you configure a maximum runtime on all job runs to prevent runaway job runs.

(Optional) Enter a value for the maximum runtime override (in minutes).
Override the networking resource configuration, that was defined when the job was created.
The network type can't be changed.

If Default Networking is configured, nothing can be overridden.

If Custom Networking is configured, you have the option to change the selected VCN and subnet.

(Optional) Change the Compute shape by selecting Change shape. Then, follow these steps in the Select compute panel.

Select an instance type.

Select a shape series.

Select one of the supported Compute shapes in the series.

Select the shape that best suits how you want to use the resource.

For each OCPU, select up to 64 GB of memory and a maximum total of 512 GB. The minimum amount of memory allowed is either 1 GB or a value matching the number of OCPUs, whichever is greater.

If using burstable VMs, toggle Burstable.
In Baseline utilization per OCPU select the percentage of OCPUs that you usually want to use. The supported values are 12.5% and 50%.

Select Select shape.

(Optional) Override the Storage configuration.
Enter the amount of block storage to use between 50 GB and 10, 240 GB (10 TB). You can change the value by 1 GB increments.

Override the environment configuration, that was defined when the job was created, by using these options:

In Environment configuration. select Select. The Set your BYOC environment panel is displayed.

In Repository select a repository from the list. If the repository is in a different compartment, select Change compartment.

In Image select an image from the list.

(Optional) In Entrypoint enter an entry point. To add another, select +Add parameter.

(Optional) In CMD enter a CMD. To add another, select +Add parameter.
Note

Use CMD as arguments to the ENTRYPOINT or the only command to run in the absence of an ENTRYPOINT.

(Optional) In Image digest enter an image digest.

(Optional) In Signature ID, if using signature verification, enter the OCID of the image signature. For example, ocid1.containerimagesignature.oc1.iad.aaaaaaaaab....

Select Select.

(Optional) Override the Logging configuration.

Select a log group from the list. You can change to a different compartment to specify a log group in a different compartment from the job.

Select one of the following to store all stdout and stderr messages:

Enable automatic log creation

Data Science automatically creates a log when the job starts.

Select a log

Select a log to use.

(Optional) Override the start up probe.

Enter a command.

(Optional) To add another command, select +Add command and repeat step a.

(Optional) Enter a value for the initial delay (in seconds).

(Optional) Enter a value of the period.

(Optional) Enter a value for the failure threshold.

Select Save.

(Optional) Select Show advanced options to add tags to the job run.
(Optional) In the Tags section, add one or more tags to the job run. If you have permissions to create a job run, then you also have permissions to apply free-form tags to that job run. To apply a defined tag, you must have permissions to use the tag namespace. For more information about tagging, see Resource Tags. If you're not sure whether to apply tags, skip this option or ask an administrator. You can apply tags later.
Select Start.
Using the Console to Start Multi Node Job Runs
Using the Console to Start Multi Node Job Runs

From the jobs list page, select the name of the job to work with. If you need help finding the list of jobs, see Listing Jobs.
Select Start a job run.
(Optional) Select a different compartment for the job run.
(Optional) Enter a name for the job run (limit of 255 characters). If you don't provide a name, a name is automatically generated.

For example, jobrun20210808222435.

(Optional) Enter a value for the maximum runtime override (in minutes).
Override the networking resource configuration, that was defined when the job was created.
The network type can't be changed.

If Default Networking is configured, nothing can be overridden.

If Custom Networking is configured, you have the option to change the selected VCN and subnet.

(Optional) Override the node group configuration, that was defined when the job was created.

Each node group's configurations can be individually overridden by clicking on the Actions menu (three dots) for the node group and selecting Edit.

(Optional) Override the number of replicas.

(Optional) Override the minimum number of replicas that must succeed.

(Optional) Override the Storage. Enter the amount of block storage to use between 50 GB and 10, 240 GB (10 TB). You can change the value by 1 GB increments.

(Optional) Override the default job configuration using these options:

Enter or select any of the following values:

Custom environment variable key

The environment variables that control the job.

Value

The value for the custom environment variable key.

You can select Additional custom environment key to specify more variables.

Command line arguments

The command line arguments that you want to use for running the job.

Change the Compute shape by selecting Change shape. Then, follow these steps in the Select compute panel.

Select an instance type.

Select a shape series.

Select one of the supported Compute shapes in the series.

Select the shape that best suits how you want to use the resource.

For each OCPU, select up to 64 GB of memory and a maximum total of 512 GB. The minimum amount of memory allowed is either 1 GB or a value matching the number of OCPUs, whichever is greater.

If using burstable VMs, toggle Burstable.
In Baseline utilization per OCPU select the percentage of OCPUs that you usually want to use. The supported values are 12.5% and 50%.

Select Select shape.

(Optional) Override the Probes configuration:

Enter a command.

(Optional) To add another command, select +Add command and repeat step a.

(Optional) Enter a value for the initial delay (in seconds).

(Optional) Enter a value of the period.

(Optional) Enter a value for the failure threshold.

Select Save.

(Optional) Override the Logging configuration.

Select a log group from the list. You can change to a different compartment to specify a log group in a different compartment from the job.

Select one of the following to store all stdout and stderr messages:

Enable automatic log creation

Data Science automatically creates a log when the job starts.

Select a log

Select a log to use.

(Optional) Select Show advanced options to add tags to the job run.
(Optional) In the Tags section, add one or more tags to the job run. If you have permissions to create a job run, then you also have permissions to apply free-form tags to that job run. To apply a defined tag, you must have permissions to use the tag namespace. For more information about tagging, see Resource Tags. If you're not sure whether to apply tags, skip this option or ask an administrator. You can apply tags later.
Select Start.

These environment variables control the job.

Use the Data Science CLI to start job runs as in this example:

Start a job run with:

oci data-science job-run create \
--display-name <job_run_name> \
--compartment-id <compartment_ocid> \
--project-id <project_ocid> \
--job-id <job_ocid> \
--configuration-override-details file://<optional_job_run_configuration_override_json_file> \
--log-configuration-override-details file://<optional_job_run_logging_configuration_override_json_file>

(Optional) Use this job run configuration override JSON file to override the configurations defined on the parent job:

jobEnvironmentConfigurationDetails: {
  jobEnvironmentType: "OCIR_CONTAINER",
    image: "iad.ocir.io/axoxdievda5j/odsc-byod-hello-wrld:0.1.3",
    imageDigest: "sha256",
  cmd: ["ls", "-h"],
  entrypoint: ["-l"],
    imageSignatureId: "ocid1.containerimagesignature.oc1.iad.0.ociodscdev.aaaaaaaaccutw5qdz6twjzkpgmbojdck3qotqqsbn7ph6xcumu4s32o6v5gq",
},
    jobConfigurationDetails: {
        jobType: "DEFAULT",
        environmentVariables: <envar-list-object>},
    ...
}

(Optional) Use this job run logging configuration override JSON file to override the logging configuration defined on the parent job:
```
{
  "enableLogging": true,
  "enableAutoLogCreation": true,
  "logGroupId": "<log_group_ocid>"
}
```

The ADS SDK is also a publicly available Python library that you can install with this command:
```
pip install oracle-ads
```
It provides the wrapper that makes starting job runs from notebooks or on your client machine easy.

Use the ADS SDK to start job runs.

Oracle Cloud Infrastructure Documentation

Starting a Job Run

Using the Console to Start Single Node Job Runs

Using the Console to Start Multi Node Job Runs