Sun N1 System Manager 1.3 Grid Engine Provisioning and Monitoring Guide

Chapter 6 Working With N1 Grid Engine Queues

This chapter describes how to access information about a grid's queues. You can see a general picture of the performance health of all the queues and view details about a particular queue.

Monitoring Queues

Queue information is available from the Queue Summary tab. You use this page to see whether a queue is functioning and how efficiently it is performing. From this page you can also view extensive details on any queue.

A queue in the N1GE environment is a means of defining a job's execution environment. This context includes features like:

A queue instance is the portion of the queue that exists on a single host.

The information in this tab is presented in a table of queue instances, that is, the portion of the queue that runs on a particular host. Every queue instance that exists in the grid is listed.

Figure 6–1 Queue Summary Page

This page shows you the list of available
queues.

The Queue Summary page show the following information:


Note –

You do not prioritize jobs using an N1GE queue. You define priorities using the extended policy system of the Sun N1 Grid Engine software. For information on job priorities, see the sge_priority(5) man page and Scheduler Policies for Job Prioritization in the Sun N1 Grid Engine 6 System (www.sun.com/blueprints/1005/819-4325.html).


For information on cluster queues, see the Monitoring and Controlling Queues section in the N1GE 6 User's Guide and the qmon man page. For more information on queue states, see the Queue Alerts page.

Viewing Complete Queue Information

The Queue Details page contains complete information for the queue instance that you selected on the Queue Summary page.

Figure 6–2 Queue Details Page

This page shows you the complete
details for a particular queue.

The Queue Details page shows the following information:

These parameters specify per job soft and hard resource limits as implemented by the setrlimit(2) system call. By default, each limit field is set to infinity which means RLIM_INFINITY as described in the setrlimit man page. The value type for the CPU-time limits s_cpu and h_cpu is time. The value type for the other limits is memory.


Note –

Not all systems support the setrlimit command. Also, s_vmem and h_vmem are only available on systems supporting RLIMIT_VMEM (see the setrlimit(2) man page on system hosting the queue).


For more information, see the complex man page.