Type the following command with the appropriate options:
# qsub options arguments |
The submission of a checkpointing job works in the same way as for regular batch scripts, except for the qsub -ckpt and qsub -c commands. These commands request a checkpointing mechanism. The commands also define the occasions at which checkpoints must be generated for the job.
The -ckpt option takes one argument, which is the name of the checkpointing environment to use. See Configuring Checkpointing Environments in Sun N1 Grid Engine 6.1 Administration Guide.
The -c option is not required. -c also takes one argument. Use the -c option to override the definitions of the when parameter in the checkpointing environment configuration. See the checkpoint(5) man page for details.
The argument to the -c option can be one of the following one-letter selections, or any combination. The argument can also be a time value.
n – No checkpoint is performed. n has the highest precedence.
s – A checkpoint is generated only if the sge_execd on the jobs host is shut down.
m – Generate the checkpoint at the minimum CPU interval defined in the corresponding queue configuration. See the min_cpu_interval parameter in the queue_conf(5) man page.
x – A checkpoint is generated if the job is suspended.
interval – Generate the checkpoint in the given interval but not more frequently than defined by min_cpu_interval. The time value must be specified as hh:mm:ss. This format specifies two digit hours, minutes, and seconds, separated by colons.
The monitoring of checkpointing jobs differs from monitoring regular jobs. Checkpointing jobs can migrate from time to time. Checkpointing jobs are therefore not bound to a single queue. However, the unique job identification number and the job name stay the same.
The deletion of checkpointing jobs works in the same way as described in Monitoring and Controlling Jobs From the Command Line.