The Cluster Queue tab provides a quick overview of all cluster queues that are defined for the cluster. The Cluster Queue tab also provides the means to suspend and resume cluster queues, to disable and enable cluster queues, as well as to configure them.
Information displayed in the Cluster Queue dialog box is updated periodically. Click Refresh to force an update. Click a cluster queue name to select the queue.
Click Delete, Suspend, Resume, Disable, or Enable to execute the corresponding operation on cluster queues that you select. The suspend/resume and disable/enable operations require notification of the corresponding sge_execd. If notification is not possible, you can force an sge_qmaster internal status change by clicking Force. For example, notification might not be possible because a host is down.
The suspend/resume and disable/enable operations require cluster queue owner permission, grid engine manager permission, or operator permission. See Managers, Operators, and Owners for details.
Suspended cluster queues are closed for further jobs. The jobs already running in suspended queues are also suspended, as described in Monitoring and Controlling Jobs With QMON. The cluster queue and its jobs are unsuspended as soon as the queue is resumed.
If a job in a suspended cluster queue was suspended explicitly, the job is not resumed when the queue is resumed. The job must be resumed explicitly.
Disabled cluster queues are closed. However, the jobs that are running in those queues are allowed to continue. The disabling of a cluster queue is commonly used to clear a queue. After the cluster queue is enabled, it is eligible to run jobs again. No action on currently running jobs is performed.
Error states are displayed using a red font in the queue list. Click Clear Error to remove an error state from a queue.
Click Reschedule to reschedule all jobs currently running in the selected cluster queues.
To configure cluster queues and queue instances, click Add or Modify on the Cluster Queue dialog box. See Configuring Queues With QMON in Sun N1 Grid Engine 6.1 Administration Guide for details.
Click Done to close the dialog box.
Each row in the cluster queue table represents one cluster queue. For each cluster queue, the table lists the following information:
Cluster Queue – Name of the cluster queue.
Load – Average of the normalized load average of all cluster queue hosts. Only hosts with a load value are considered.
Used – Number of currently used job slots.
Avail – Number of currently available job slots.
Total – Total number of job slots.
aoACD – Number of queue instances that are in at least one of the following states:
a – Load threshold alarm
o – Orphaned
A – Suspend threshold alarm
C – Suspended by calendar
D – Disabled by calendar
cdsuE – Number of queue instances that are in at least one of the following states:
c – Configuration ambiguous
d – Disabled
s – Suspended
u – Unknown
E – Error
s – Number of queue instances that are in the suspended state.
A – Number of queue instances where one or more suspend thresholds are currently exceeded. No more jobs
S – Number of queue instances that are suspended through subordination to another queue.
C – Number of queue instances that are automatically suspended by the grid engine system calendar.
u – Number of queue instances that are in an unknown state.
a – Number of queue instances where one or more load thresholds are currently exceeded.
d – Number of queue instances that are in the disabled state.
D – Number of queue instances that are automatically disabled by the grid engine system calendar.
c – Number of queue instances whose configuration is ambiguous.
o – Number of queue instances that are in the orphaned state.
E – Number of queue instances that are in the error state.
See the qstat(1) man page for complete information about cluster queues and their states.