Documentation Home
> Sun N1 Grid Engine 6.1 Administration Guide
Sun N1 Grid Engine 6.1 Administration Guide
Book Information
Index
A
B
C
D
E
F
G
H
I
J
K
L
M
N
O
P
Q
R
S
T
U
V
W
X
Preface
Chapter 1 Configuring Hosts and Clusters
About Hosts and Daemons
Migrating qmaster to Another Host
How to Migrate qmaster to Another Host Using a Script
How to Migrate qmaster to Another Host Manually
Configuring Shadow Master Hosts
Shadow Master Host Requirements
Shadow Master Hosts File
Starting Shadow Master Hosts
Configuring Shadow Master Hosts Environment Variables
Configuring Hosts
Configuring Execution Hosts With QMON
Adding or Modifying an Execution Host
Defining Scaling Factors
Defining Resource Attributes
Defining Access Permissions
Defining Reporting Variables
Deleting an Execution Host
Shutting Down an Execution Host Daemon
Configuring Execution Hosts From the Command Line
Configuring Administration Hosts With QMON
Adding an Administration Host
Deleting an Administration Host
Configuring Administration Hosts From the Command Line
Configuring Submit Hosts With QMON
Adding a Submit Host
Deleting a Submit Host
Configuring Submit Hosts From the Command Line
Configuring Host Groups With QMON
Adding or Modifying a Host Group
Deleting a Host Group
Configuring Host Groups From the Command Line
Monitoring Execution Hosts With qhost
Invalid Host Names
Killing Daemons From the Command Line
Restarting Daemons From the Command Line
Basic Cluster Configuration
Displaying a Cluster Configuration With QMON
Displaying the Global Cluster Configuration With QMON
Adding and Modifying Global and Host Configurations With QMON
Deleting a Cluster Configuration With QMON
Displaying the Basic Cluster Configurations From the Command Line
Modifying the Basic Cluster Configurations From the Command Line
Chapter 2 Configuring Queues and Queue Calendars
Configuring Queues
Configuring Queues With QMON
Configuring General Parameters
Configuring Execution Method Parameters
Configuring the Checkpointing Parameters
Configuring Parallel Environments
Configuring Load and Suspend Thresholds
Configuring Limits
Configuring Complex Resource Attributes
Configuring Subordinate Queues
Configuring User Access Parameters
Configuring Project Access Parameters
Configuring Owners Parameters
Configuring Queues From the Command Line
Configuring Queue Calendars
Configuring Queue Calendars With QMON
Configuring Queue Calendars From the Command Line
Chapter 3 Configuring Complex Resource Attributes
Complex Resource Attributes
Configuring Complex Resource Attributes With QMON
Assigning Resource Attributes to Queues, Hosts, and the Global Cluster
Queue Resource Attributes
Host Resource Attributes
Global Resource Attributes
Adding Resource Attributes to the Complex
Consumable Resources
Setting Up Consumable Resources
Examples of Setting Up Consumable Resources
Example 1: Floating Software License Management
Example 2: Space Sharing for Virtual Memory
Example 3: Managing Available Disk Space
Configuring Complex Resource Attributes From the Command Line
Load Parameters
Default Load Parameters
Adding Site-Specific Load Parameters
Writing Your Own Load Sensors
Load Sensor Rules Format
Example of a Load Sensor Script
Chapter 4 Managing User Access
Setting Up a User
Configuring User Access
Configuring Manager Accounts
Configuring Manager Accounts With QMON
Configuring Manager Accounts From the Command Line
Configuring Operator Accounts
Configuring Operator Accounts With QMON
Configuring Operator Accounts From the Command Line
Configuring User Access Lists
Configuring User Access Lists With QMON
Configuring User Access Lists From the Command Line
Defining Usersets As Projects and Departments
Configuring Users
Configuring User Objects With QMON
Configuring User Objects From the Command Line
Defining Projects
Defining Projects With QMON
Defining Projects From the Command Line
Using Path Aliasing
Format of Path-Aliasing Files
How Path-Aliasing Files Are Interpreted
Configuring Default Requests
Format of Default Request Files
Chapter 5 Managing Policies and the Scheduler
Administering the Scheduler
About Scheduling
Scheduling Strategies
Dynamic Resource Management
Tickets
Queue Sorting
Job Sorting
About the Urgency Policy
Resource Reservation and Backfilling
What Happens in a Scheduler Interval
Scheduler Monitoring
Configuring the Scheduler
Default Scheduling
Scheduling Alternatives
Changing the Scheduling Algorithm
Scaling System Load
Selecting Queue by Sequence Number
Selecting Queue by Share
Restricting the Number of Jobs per User or Group
Changing the Scheduler Configuration With QMON
Administering Policies
Configuring Policy-Based Resource Management With QMON
Specifying Policy Priority
Configuring the Urgency Policy
Configuring Ticket-Based Policies
Editing Tickets
Sharing Override Tickets
Sharing Functional Ticket Shares
Tuning Scheduling Run Time
Setting the Ticket Policy Hierarchy
Configuring the Share-Based Policy
The Half-Life Factor
Compensation Factor
Hierarchical Share Tree
Configuring the Share-Tree Policy With QMON
Node Attributes
Share Tree Policy Parameters
About the Special User default
Configuring the Share-Based Policy From the Command Line
How to Create Project-Based Share-Tree Scheduling
Configuring the Functional Policy
Functional Shares
Configuring the Functional Share Policy With QMON
Function Category List
Functional Shares Table
Changing Functional Configurations
Ratio Between Sorts of Functional Tickets
Configuring the Functional Share Policy From the Command Line
How to Create User-Based, Project-Based, and Department-Based Functional Scheduling
Configuring the Override Policy
Configuring the Override Policy With QMON
Override Category List
Override Table
Changing Override Configurations
Configuring the Override Policy From the Command Line
Chapter 6 Managing Resource Quotas
Resource Quota Overview
About Resource Quota Sets
Static and Dynamic Resource Quotas
Managing Resource Quotas With QMON
How to Set Resource Quotas Using QMON
Monitoring Resource Quota Utilization From the Command Line
Configuring Resource Quotas from the Command Line
Example
Performance Considerations
Efficient Rule Sets
Chapter 7 Managing Special Environments
Configuring Parallel Environments
Configuring Parallel Environments With QMON
Displaying Configured Parallel Environment Interfaces With QMON
Configuring Parallel Environments From the Command Line
Parallel Environment Startup Procedure
Termination of the Parallel Environment
Tight Integration of Parallel Environments and Grid Engine Software
Configuring Checkpointing Environments
About Checkpointing Environments
Configuring Checkpointing Environments With QMON
Viewing Configured Checkpointing Environments
Adding a Checkpointing Environment
Modifying Checkpointing Environments
Deleting Checkpointing Environments
Configuring Checkpointing Environments From the Command Line
Chapter 8 Other Administrative Tasks
Gathering Accounting and Reporting Statistics
Report Statistics (ARCo)
About the dbwriter Program
Enabling the Reporting File
Calculating Derived Values With dbwriter
Deleting Outdated Records With dbwriter
Accounting and Usage Statistics (qacct)
Backing Up the Grid Engine System Configuration
How to Perform a Manual Backup
How to Restore from a Backup
Using Files and Scripts for Administration Tasks
Using Files to Add or Modify Objects
Using Files to Modify Queues, Hosts, and Environments
Targeting Queue Instances with the qselect Command
Using Files to Modify a Global Configuration or the Scheduler
Chapter 9 Fine Tuning, Error Messages, and Troubleshooting
Fine-Tuning Your Grid Environment
Scheduler Monitoring
Finished Jobs
Job Validation
Load Thresholds and Suspend Thresholds
Load Adjustments
Immediate Scheduling
Urgency Policy and Resource Reservation
Using DTrace for Performance Tuning
Tuning Performance from the Command Line through DTrace
Analyzing Bottlenecks on the Grid Engine Master
Sample DTrace Output for Bottleneck Analysis
How the Grid Engine Software Retrieves Error Reports
Consequences of Different Error or Exit Codes
Running Grid Engine System Programs in Debug Mode
Setting the dbwriter Debug Level
Diagnosing Problems
Pending Jobs Not Being Dispatched
Job or Queue Reported in Error State E
Troubleshooting Common Problems
Chapter 10 Configuring DBWriter
Setup
Database System
Database Server
Base Directory for Reporting Files
Configuration
Interval
Pid
PidCmd
Continuous Mode
Debug Level
Reporting File
Calculation of Derived Values
Derived Values Format
Examples
Deleting Outdated Records
Examples
© 2010, Oracle Corporation and/or its affiliates