Documentation Home
> Sun N1 Grid Engine 6.1 User's Guide
Sun N1 Grid Engine 6.1 User's Guide
Book Information
Index
Numbers and Symbols
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W
X
Preface
Chapter 1 Introduction to the N1 Grid Engine 6.1 Software
What Is Grid Computing?
Managing Workload by Managing Resources and Policies
How the System Operates
Matching Resources to Requests
Jobs and Queues
Usage Policies
Using Tickets to Administer Policies
Using the Urgency Policy to Assign Job Priority
Grid Engine System Components
Hosts
Master Host
Execution Hosts
Administration Hosts
Submit Hosts
Daemons
sge_qmaster – The Master Daemon
sge_schedd – The Scheduler Daemon
sge_execd – The Execution Daemon
Queues
Client Commands
QMON, the Grid Engine System's Graphical User Interface
Chapter 2 Navigating the Grid Engine System
QMON Main Control Window
Launching the QMON Main Control Window
Customizing QMON
Users and User Categories
User Access Permissions
Managers, Operators, and Owners
Displaying Queues and Queue Properties
Displaying a List of Queues
Displaying Queue Properties
How to Display Queue Properties With QMON
Displaying Queue Properties From the Command Line
Interpreting Queue Property Information
Hosts and Host Functionality
Finding the Name of the Master Host
Displaying a List of Execution Hosts
Displaying a List of Administration Hosts
Displaying a List of Submit Hosts
Requestable Attributes
Displaying a List of Requestable Attributes
Chapter 3 Submitting Jobs
Submitting a Simple Job
How To Submit a Simple Job From the Command Line
How To Submit a Simple Job With QMON
Submitting Batch Jobs
About Shell Scripts
Example of a Shell Script
Extensions to Regular Shell Scripts
How a Command Interpreter Is Selected
Output Redirection
Active Comments
Environment Variables
Submitting Extended Jobs and Advanced Jobs
Submitting Extended Jobs With QMON
Extended Job Example
Submitting Extended Jobs From the Command Line
Submitting Advanced Jobs With QMON
Advanced Job Example
Submitting Advanced Jobs From the Command Line
Default Request Files
Defining Resource Requirements
How the Grid Engine System Allocates Resources
Job Dependencies
Submitting Array Jobs
Submitting an Array Job With QMON
Submitting an Array Job From the Command Line
Submitting Interactive Jobs
Submitting Interactive Jobs With QMON
Submitting Interactive Jobs With qsh
Submitting Interactive Jobs With qlogin
Transparent Remote Execution
Remote Execution With qrsh
Invoking Transparent Remote Execution With qrsh
Transparent Job Distribution With qtcsh
qtcsh Usage
Parallel Makefile Processing With qmake
qmake Usage
How Jobs Are Scheduled
Job Priorities
Ticket Policies
Queue Selection
Chapter 4 Monitoring and Controlling Jobs and Queues
Monitoring and Controlling Jobs
Monitoring and Controlling Jobs With QMON
Refreshing the Job Control Display
Selecting Jobs
Managing Jobs
Putting Jobs on Hold
Putting Array Job Tasks on Hold
Changing Job Priority
Verifying Job Consistency
Using the Why? Button to Get Information About Pending Jobs
Clearing Error States
Customizing the Job Control Display
Filtering the Job List
Getting Additional Information About Jobs With the QMON Object Browser
Monitoring and Controlling Jobs From the Command Line
Monitoring Jobs With qstat
Controlling Jobs With qdel and qmod
Monitoring Jobs by Email
Monitoring and Controlling Queues
Monitoring and Controlling Queues With QMON
Monitoring and Controlling Cluster Queues
Cluster Queue Status
Monitoring and Controlling Queue Instances
Queue Instance Status
Displaying Queue Instance Attributes
Filtering Cluster Queues and Queue Instances
Controlling Queues With qmod
Using Job Checkpointing
User-Level Checkpointing
Kernel-Level Checkpointing
Migrating Checkpointing Jobs
Composing a Checkpointing Job Script
Submitting, Monitoring, or Deleting a Checkpointing Job From the Command Line
Submitting a Checkpointing Job With QMON
File System Requirements for Checkpointing
Chapter 5 Accounting and Reporting
Starting the Accounting and Reporting Console
How to Start the Accounting and Reporting Console
Creating and Running Simple Queries
How to Create a Simple Query
How to Create a View Configuration
Defining Data Series for Diagrams
How to Run a Simple Query
How to Edit a Simple Query
Creating and Running Advanced Queries
How to Create an Advanced Query
How to Run an Advanced Query
How to Edit an Advanced Query
Latebindings for Advanced Queries
Chapter 6 Automating Grid Engine Functions Through the Distributed Resource Management Application API
Introduction to Distributed Resource Management Application API (DRMAA)
Developing with the C Language Binding
Important Files for the C Language Binding
Including the DRMAA Header File
Compiling Your C Application
Running Your C Application
How to Use the DRMAA 0.95 C Language Binding
C Application Examples
Developing with the Java Language Binding
Important Files for the Java Language Binding
Importing the DRMAA Java Classes and Packages
Compiling Your Java Application
How to Use DRMAA with NetBeans 5.x
Running Your Java Application
Using the DRMAA 0.5 Java Language Binding
Java Application Examples
Chapter 7 Error Messages, and Troubleshooting
How the Software Retrieves Error Reports
Consequences of Different Error or Exit Codes
Running Grid Engine System Programs in Debug Mode
Setting the dbwriter Debug Level
Diagnosing Problems
Pending Jobs Not Being Dispatched
Job or Queue Reported in Error State E
Troubleshooting Common Problems
Typical Accounting and Reporting Console Errors
Appendix A Database Schemas
Schema Tables
sge_job
sge_job_usage
sge_job_request
sge_job_log
sge_share_log
sge_host
sge_host_values
sge_queue
sge_queue_values
sge_department
sge_department_values
sge_project
sge_project_values
sge_user
sge_user_values
sge_group
sge_group_values
List of Predefined Views
view_accounting
view_job_times
view_jobs_completed
view_job_log
view_department_values
view_group_values
view_host_values
view_project_values
view_queue_values
view_user_values
List of Derived Values
Glossary
© 2010, Oracle Corporation and/or its affiliates