Domain Administration Using the Domain Agent
|
This chapter describes Sun Management Center 3.5 domain administration through the domain agent for Sun Fire 6800/4810/4800/3800 systems.
This chapter contains the following topics.
Setting Up Administrative Domains
This is a general procedure. For instructions, refer to the Sun Management Center 3.5 User's Guide.
Starting and Stopping Agents
Refer to the Sun Management Center 3.5 User's Guide.
Creating a Node
This is a general procedure. For instructions, refer to the Sun Management Center 3.5 User's Guide.
Config-Reader Module
A Config-Reader module, Config-Reader-Sun Fire(3600-6800), is automatically loaded during installation. You can use the Config-Reader module to see the physical view and logical view of your host.
In addition, the Config-Reader module monitors your hardware and alerts you whenever there is a problem. For example, this module checks for dual inline memory module (DIMM) errors.
The Config-Reader icon is located under the Hardware icon in the Details window (see FIGURE 4-3).
To Use the Config-Reader Module
|
1. In the Sun Management Center console, double-click a Sun Fire 6800/4810/4800/3800 system icon.
The Details window is displayed (FIGURE 4-1).
FIGURE 4-1 Domain Details Window
2. Double-click the Hardware icon in the Details window.
The Config-Reader-Sun Fire(3800-6800) and the Sun Fire (3800-6800)-Rules icons are displayed (FIGURE 4-2).
FIGURE 4-2 Config-Reader and Rules Icons
3. You can now choose either to:
- Double-click the Config-Reader-Sun Fire(3800-6800) icon to display all devices in the system (FIGURE 4-3), and then double-click a device icon to display properties and values for that device.
- Double-click the Sun Fire(3800-6800)-Rules icon to display rules icons (FIGURE 4-4), and then double-click a rules icon to display properties and values.
To see the properties and values that are available, see Accessing Tables in the Domain Config-Reader Module. For a list of failures that trigger Config Reader alarms, see Sun Fire 6800/4810/4800/3800 System Rules.
FIGURE 4-3 Config-Reader Devices
FIGURE 4-4 Sun Fire 6800/4810/4800/3800 Systems Rules Tables
Loading the Config-Reader Module
If the icon for the Config-Reader-Sun Fire(3800-6800) module or the Sun Fire(3800-6800)-Rules module is not displayed in the Browser tab of the Details window for your Sun Fire 6800/4810/4800/3800 system, the corresponding module is not loaded. In that case, you can manually load one or both modules, as shown below.
To Load a Module
|
1. In the Sun Management Center console, double-click the Sun Fire 6800/4810/4800/3800 system icon.
The Details window is displayed (FIGURE 4-1).
2. Click the Modules tab in the Details window.
The Modules data is displayed (FIGURE 4-5).
FIGURE 4-5 Modules Tab in the Details Window
3. Select Config-Reader-Sun Fire(3800-6800) or Sun Fire(3800-6800)-Rules in the Available Modules list, then click Load.
The Module Loader pop-up window is displayed.
4. Click OK in the Module Loader pop-up window.
If you have sufficient access privileges, the pop-up window closes, and the module moves into the Modules with Load Status list.
If you do not have sufficient access privileges, the pop-up window displays an error message. See Assigning Users to Groups for information about access privileges.
Accessing Tables in the Domain Config-Reader Module
This section includes the Config-Reader module data property tables:
The following tables describe the data properties contained in each of the domain Config-Reader tables. When selected, the Config-Reader data property tables are displayed in the Browser tab of the Details window. For more information, see the Browser chapter in the Sun Management Center 3.5 User's Guide.
Domain System
TABLE 4-1 provides a brief description of the properties for the Sun Fire 6800/4810/4800/3800 system that contains the domain.
TABLE 4-1 Sun Fire 6800/4810/4800/3800 Domain System
Property
|
Rule (if any)
|
Description
|
Name
|
|
Displays the instance name
|
Operating System
|
|
Displays the operating system running on the machine
|
Operating System Version
|
|
Displays the operating system version
|
System Clock Frequency
|
|
Displays the clock frequency in megahertz (MHz)
|
Architecture
|
|
Displays the architecture of the machine
|
Hostname Of The System
|
|
Displays the host name of the system
|
Machine Name
|
|
Displays the machine type
|
System Platform
|
|
Displays the hardware platform of the system
|
Serial Number
|
|
Displays the serial number of the machine
|
Timestamp
|
|
Displays the time stamp value
|
Raw Timestamp
|
|
Displays the raw time stamp value
|
Total Disks
|
|
Displays the total number of disks present in the system
|
Total Memory
|
|
Displays the total memory present in the system in megabytes (MB)
|
Total Processors
|
|
Displays the total processors present in the system
|
Total Tape Devices
|
|
Displays the total tape devices present in the system
|
Domain Boards
TABLE 4-2 provides a brief description of the properties for boards on a Sun Fire 6800/4810/4800/3800 domain.
TABLE 4-2 Sun Fire 6800/4810/4800/3800 Domain Boards
Property
|
Rule (if any)
|
Description
|
Name
|
|
Displays the system name and slot number for this board, such as board(1), board(3), or board(8)
|
Label Name
|
|
Displays the label name and slot number for this unit, such as system board (SB1 or SB3), or I/O board (IB8)
|
Board No
|
|
Displays the board slot number, such as 1, 3, or 8
|
Fru
|
|
Indicates whether the unit is a field-replaceable unit (yes or no)
|
Hot Plugged
|
|
Indicates whether the board has or has not been hot-plugged into the system (yes or no)
|
Hot Pluggable
|
|
Indicates whether the board is or is not hot-pluggable (yes or no)
|
Memory Size
|
|
Displays the memory size in megabytes (MB)
|
Condition
|
rcrse301
|
Displays the board condition: OK, UNKNOWN, or FAILED
|
Type
|
|
Displays the board type, such as CPU_Board, CPCI I/O board, or PCI_I/O_Board. Includes whether a CPU board is also a COD board (COD_CPU_Board) and whether the board is unknown, such as unknown IO board
|
Domain CPU Units
TABLE 4-3 provides a brief description of the properties for CPU units on a Sun Fire 6800/4810/4800/3800 domain.
TABLE 4-3 Sun Fire 6800/4810/4800/3800 Domain CPU Units
Property
|
Rule (if any)
|
Description
|
Name
|
|
Displays the system name and slot number for this unit, such as cpu-unit(4) or cpu-unit(5)
|
Board No.
|
|
Displays the number of the board where this processor is located
|
Clock Frequency
|
|
Displays the frequency of the timer in megahertz (MHz)
|
Cpu Type
|
|
Displays the processor machine type
|
Dcache Size
|
|
Displays the size of data cache (Dcache) in kilobytes (KB)
|
Ecache Size
|
|
Displays the size of external cache (Ecache) megabytes (MB)
|
Fru
|
|
Indicates whether the unit is a field-replaceable unit (yes or no)
|
Icache Size
|
|
Displays the size of instruction cache (Icache) in kilobytes (KB)
|
Model
|
|
Displays the processor model
|
Processor ID
|
|
Displays the identification number of the processor
|
Status
|
rcrse207
|
Displays the CPU unit status: OK, online, --, noncritical, or offline
|
Unit
|
|
Displays the identification number of the unit
|
Domain DIMMs
TABLE 4-4 provides a brief description of the properties for dual inline memory modules (DIMMs) on a Sun Fire 6800/4810/4800/3800 domain.
TABLE 4-4 Sun Fire 6800/4810/4800/3800 Domain DIMMs
Property
|
Rule (if any)
|
Description
|
Name
|
|
Displays the system name and slot number for this unit, such as dimm(0) or dimm(1)
|
Physical Bank No
|
|
Displays the physical bank number where this DIMM is located
|
Bank Size
|
|
Displays the bank size in megabytes (MB)
|
Bank Status
|
|
Displays the operating status: pass, unpopulated, or fail
|
Fru
|
|
Indicates whether the unit is a field-replaceable unit (yes or no)
|
Dimm Size
|
|
Displays the size of the DIMM in megabytes (MB)
|
Domain I/O Controllers
TABLE 4-5 provides a brief description of the properties for I/O controllers on a Sun Fire 6800/4810/4800/3800 domain.
TABLE 4-5 Sun Fire 6800/4810/4800/3800 Domain I/O Controllers
Property
|
Description
|
Name
|
Displays the system name and slot number for this unit, such as pcisch(8) or pcisch(9)
|
Device Type
|
Displays the device type: pci
|
Instance Number
|
Displays the instance number
|
Model
|
Displays the device model
|
Reg
|
Displays the register address
|
Port id
|
Displays the port identifier
|
Version Number
|
Displays the version number
|
Domain Sun Fire Link ASIC
TABLE 4-6 briefly describes the Sun Fire Link ASIC (WCI) properties for a Sun Fire 6800/4810/4800/3800 domain. Refer to the Sun Fire Link Fabric Administrator's Guide for more information about the Sun Fire Link system.
TABLE 4-6 Sun Fire 6800/4810/4800/3800 Domain Sun Fire Link ASIC (WCI)
Property
|
Description
|
Name
|
Displays the system name for this unit, such as wci(1d) or wci(1f)
|
Number of Parolis
|
Displays the number of Paroli daughter-card assembly (DCA) cards
|
Domain Sun Fire Link Paroli DCA
TABLE 4-7 briefly describes the Sun Fire Link Paroli daughter card assembly (DCA) properties for a Sun Fire 6800/4810/4800/3800 domain. Refer to the Sun Fire Link Fabric Administrator's Guide for more information about the Sun Fire Link system.
Note - Paroli card presence can be determined only if the domain is part of a Sun Fire Link cluster. If the domain is not part of a Sun Fire Link cluster, the Paroli card table will be empty; however, this is not an indication that there is no Paroli card in the domain.
|
TABLE 4-7 Sun Fire 6800/4810/4800/3800 Domain Sun Fire Link Paroli DCA
Property
|
Description
|
Name
|
Displays the name of the Paroli card, such as paroli(0) or paroli(1)
|
Fru
|
Indicates whether the unit is a field-replaceable unit (yes or no)
|
Link Number
|
Identifies the port number link to the Paroli card (0 or 2)
|
Link Validity
|
Indicates whether the link is VALID or INVALID to the Paroli card
|
Link State
|
Displays the current state of the link: LINK UP, LINK DOWN, LINK NOT PRESENT, WAIT FOR SC LINK TAKEDOWN, WAIT FOR SC LINK UP, SC ERROR WAIT FOR LINK DOWN, or UNKNOWN
|
Remote Link Number
|
Identifies the link to the remote Paroli card (0-2)
|
Remote Cluster Member
|
Displays the host name of the cluster member at the remote end of the link
|
Domain I/O Devices
TABLE 4-8 provides a brief description of the properties for I/O devices on a Sun Fire 6800/4810/4800/3800 domain.
TABLE 4-8 Sun Fire 6800/4810/4800/3800 Domain IO Devices
Property
|
Description
|
Name
|
Displays the system name for this unit
|
Device Type
|
Displays the device type
|
Disk Count
|
Displays the number of drives attached to this unit
|
Instance Number
|
Displays the instance number
|
Model
|
Displays the model
|
Network Count
|
Displays the number of networks attached to this unit
|
Reg
|
Displays the register address
|
Tape Count
|
Displays the number of drives attached to this unit
|
Domain Disk Devices
TABLE 4-9 provides a brief description of the properties for disk devices on a Sun Fire 6800/4810/4800/3800 domain.
TABLE 4-9 Sun Fire 6800/4810/4800/3800 Domain Disk Devices
Property
|
Description
|
Name
|
Displays the system name for this unit, such as sd(x), where x is the development index of the disk device
|
Device Type
|
Displays the device type, such as disk or CD-ROM
|
Disk Name
|
Displays the controller name, such as c110d0 or c210d0
|
Fru
|
Indicates whether the unit is a field-replaceable unit (yes or no)
|
Instance Number
|
Displays the instance number
|
Disk Target
|
Displays the disk target
|
Domain Tape Devices
TABLE 4-10 provides a brief description of the properties for tape devices on a Sun Fire 6800/4810/4800/3800 domain.
TABLE 4-10 Sun Fire 6800/4810/4800/3800 Domain Tape Devices
Property
|
Rule (if any)
|
Description
|
Name
|
|
Displays the system name for this unit, such as st(x), where x is the development index of the tape device
|
Device Type
|
|
Displays the device type, such as tape drive
|
Fru
|
|
Indicates whether the unit is a field-replaceable unit (yes or no)
|
Instance Number
|
|
Displays the instance number
|
Model
|
|
Displays the model
|
Tape Name
|
|
Displays the tape name
|
Status
|
rcrse225
|
Displays the operating status, including OK, ok, or drive present, but busy
|
Tape Target
|
|
Displays the tape target number
|
Domain Network Devices
TABLE 4-11 provides a brief description of the properties for network devices on a Sun Fire 6800/4810/4800/3800 domain.
TABLE 4-11 Sun Fire 6800/4810/4800/3800 Domain Network Devices
Property
|
Description
|
Name
|
Displays the system name for this unit, such as hme(5)
|
Device Type
|
Displays the device type: network
|
Ethernet Address
|
Displays the Ethernet address
|
Internet Address
|
Displays the Internet address
|
Interface Name
|
Displays the interface name
|
Symbolic Name
|
Displays the symbolic name
|
Domain Memory Controller
TABLE 4-12 provides a brief description of the properties for a memory controller on a Sun Fire 6800/4810/4800/3800 domain.
TABLE 4-12 Sun Fire 6800/4810/4800/3800 Domain Memory Controller
Property
|
Description
|
Name
|
Displays the system name for this unit, such as memory-controller(14,400000)
|
Compatible
|
Displays compatible software packages
|
Device Type
|
Displays the device type: memory-controller
|
Port Id
|
Displays the port identifier
|
Reg
|
Displays the register address
|
Domain Config-Reader Rules
This section describes the alarm rules for the domain Config-Reader module. The system provides a message with the alarms telling what the current property is and what the limit is.
CPU Unit Status Rule (rcrse207)
The CPU unit status rule generates a critical alarm when the CPU unit status is not OK, online, --, or noncritical.
TABLE 4-13 Sun Fire 6800/4810/4800/3800 Domain Config-Reader CPU Unit Status Rule
Alarm Level
|
Meaning
|
Critical
|
CPU unit is in a critical status.
|
Action:
Contact your Sun service personnel.
Tape Status Rule (rcrse225)
The tape status rule generates a critical alarm when the tape status is not OK, ok, or drive present, but busy.
TABLE 4-14 Sun Fire 6800/4810/4800/3800 Domain Config-Reader Tape Status Rule
Alarm Level
|
Meaning
|
Critical
|
Tape is in a critical status.
|
Action:
Contact your Sun service personnel.
System Board Condition Rule (rcrse301)
The system board condition rule generates an information alarm when the system board condition is not OK.
TABLE 4-15 Sun Fire 6800/4810/4800/3800 Domain Config-Reader System Board Condition Rule
Alarm Level
|
Meaning
|
Info
|
System board condition is not OK.
|
Action:
This alarm is for your information only; no action is needed.
Attachment Point Status Rule (rLnkVld)
The attachment point status rule generates a information alarm if the state is not VALID.
TABLE 4-16 Sun Fire 6800/4810/4800/3800 Domain Config-Reader Attachment Point Status Rule
Alarm Level
|
Meaning
|
Info
|
Attachment point state is not VALID.
|
Action:
This alarm is for your information only; no action is needed.
Sun Fire 6800/4810/4800/3800 System Rules
This section describes the alarm rules for the Sun Fire 6800/4810/4800/3800 system. The system provides a message with the alarms telling what the current property is and what the limit is.
CPU Error Message Rule-Solaris 8, Update 5 and Later (rsr1000)
The CPU error message rule generates a critical alarm, when a CPU correctable error is detected. This alarm applies to Solaris 8, Update 5 Operating Environment and later.
TABLE 4-17 Sun Fire 6800/4810/4800/3800 System CPU Error Message Rule
Alarm Level
|
Meaning
|
Critical
|
CPU correctable error was detected in the /var/adm/messages file.
|
Action:
Contact your Sun service personnel.
CPU Error Message Rule-Pre-Solaris 8, Update 5 (rsr1001)
The CPU error message rule generates a critical alarm, when a error-correcting code (ECC) memory error is detected. This alarm applies to Operating Environments earlier than Solaris 8, Update 5.
TABLE 4-18 Sun Fire 6800/4810/4800/3800 System CPU Error Message Rule
Alarm Level
|
Meaning
|
Critical
|
ECC memory error was detected in the /var/adm/messages file.
|
Action:
Contact your Sun service personnel.
SCSI Warning Message Rule (rsr1002)
The Small Computer System Interface (SCSI) warning message rule generates a warning alarm when a warning is detected because of an invalid magic number.
TABLE 4-19 Sun Fire 6800/4810/4800/3800 System SCSI Warning Message Rule
Alarm Level
|
Meaning
|
Warning
|
SCSI warning was detected in the /var/adm/messages file because of an invalid magic number.
|
Action:
Contact your Sun service personnel.
UNIX Warning Message Rule (rsr1003)
The UNIX warning message rule generates a warning alarm when a warning is detected, because an interrupt has not been serviced.
TABLE 4-20 Sun Fire 6800/4810/4800/3800 System UNISX Warning Message Rule
Alarm Level
|
Meaning
|
Warning
|
UNIX warning was detected in the /var/adm/messages file because an interrupt has not been serviced.
|
Action:
Contact your Sun service personnel.
Genunix Date Warning Message Rule (rsr1004)
The Genunix date warning message rule generates a warning alarm when a warning is detected, because the last shutdown time was later than the time on the time-of-day chip.
TABLE 4-21 Sun Fire 6800/4810/4800/3800 System Genunix Date Warning Message Rule
Alarm Level
|
Meaning
|
Warning
|
Genunix date warning was detected in the /var/adm/messages file, because the last shutdown time was later than the time on the time-of-day chip.
|
Action:
Contact your Sun service personnel.
Genunix Clock Warning Message Rule (rsr1005)
The Genunix clock warning message rule generates a warning alarm when a warning is detected, because the maximum swap space is less than the free space.
TABLE 4-22 Sun Fire 6800/4810/4800/3800 System Genunix Clock Warning Message Rule
Alarm Level
|
Meaning
|
Warning
|
Genunix clock warning was detected in the /var/adm/messages file, because the maximum swap space is less than the free space.
|
Action:
Contact your Sun service personnel.
Fan Plane Warning Message Rule (rsr1006)
The fan plane warning message rule generates a warning alarm when a warning is detected.
TABLE 4-23 Sun Fire 6800/4810/4800/3800 System Fan Plane Warning Message Rule
Alarm Level
|
Meaning
|
Warning
|
Fan plane warning was detected in the /var/adm/messages file.
|
Action:
Contact your Sun service personnel.
LUN Failure Rule (rsr1007)
The logical unit number (LUN) failure rule generates a critical alarm when a LUN failure is detected.
TABLE 4-24 Sun Fire 6800/4810/4800/3800 System LUN Failure Rule
Alarm Level
|
Meaning
|
Critical
|
LUN failure was detected in the /var/adm/messages file.
|
Action:
Contact your Sun service personnel.
PLOGI Failure Rule (rsr1008)
The PLOGI failure rule generates a critical alarm when a PLOGI failure is detected.
TABLE 4-25 Sun Fire 6800/4810/4800/3800 System PLOGI Failure Rule
Alarm Level
|
Meaning
|
Critical
|
PLOGI failure was detected in the /var/adm/messages file.
|
Action:
Contact your Sun service personnel.
ECC Correction Rule (rsr1009)
The ECC correction rule generates an information alarm if the ECC had an error and the ECC data bit has been corrected.
TABLE 4-26 Sun Fire 6800/4810/4800/3800 System ECC Correction Rule
Alarm Level
|
Meaning
|
Info
|
ECC data bit is corrected.
|
Action:
This alarm is for your information only; no action is needed.
Qlogic Error Rule (rsr1010)
The Qlogic error rule generates an alarm when a Qlogic loop error is detected.
TABLE 4-27 Sun Fire 6800/4810/4800/3800 System Qlogic Error Rule
Value
|
Alarm Level
|
Meaning
|
OFFLINE
|
Warning
|
Qlogic loop went offline.
|
Others
|
Info
|
Qlogic loop went online
|
Action:
- Contact your Sun service personnel if you see the Warning alarm.
- The Info alarm is for your information only; no action is needed.
Kernel Correction Rule (rsr1011)
The kernel correction rule generates a warning if a clear ECC warning is detected.
TABLE 4-28 Sun Fire 6800/4810/4800/3800 System Kernel Correction Rule
Alarm Level
|
Meaning
|
Warning
|
Clear ECC warning is detected in the /var/adm/messages file, and the kernel cleared an ECC error.
|
Action:
Contact your Sun service personnel.
SCSI Info Event Rule (rsr1012)
The SCSI information event rule generates an information alarm when a SCSI information event is detected.
TABLE 4-29 Sun Fire 6800/4810/4800/3800 System SCSI Info Event Rule
Alarm Level
|
Meaning
|
Info
|
SCSI disk okay and related messages were detected in the /var/adm/messages file.
|
Action:
This alarm is for your information only; no action is needed.
SCSI Disk Online Rule (rsr1013)
The SCSI disk online rule generates a info alarm when a SCSI disk goes online.
TABLE 4-30 Sun Fire 6800/4810/4800/3800 System SCSI Disk Online Rule
Alarm Level
|
Meaning
|
Info
|
SCSI disk went online.
|
Action:
This alarm is for your information only; no action is needed.
Temperature State Rule (rsr1014)
The temperature state rule generates an alarm when the temperature state value is not 1.
TABLE 4-31 Sun Fire 6800/4810/4800/3800 System Temperature State Rule
Value
|
Alarm Level
|
Meaning
|
1
|
|
Temperature state is okay.
|
2
|
Warning
|
Component temperature crosses the warning level.
|
Others
|
Critical
|
Component temperature crosses the error level.
|
Action:
Contact your Sun service personnel.
Power State Rule (rsr1015)
The power state rule generates an alarm when the power state value is not 1.
TABLE 4-32 Sun Fire 6800/4810/4800/3800 System Temperature State Rule
Value
|
Alarm Level
|
Meaning
|
1
|
|
Power state is okay.
|
2
|
Warning
|
Power supply crosses the warning voltage threshold.
|
Others
|
Critical
|
Power supply fails.
|
Action:
Contact your Sun service personnel.
Physical and Logical Views of a Domain
The Hardware tab in the Details window allows you to view physical and logical hardware configurations of Sun Fire 6800/4810/4800/3800 systems. For instructions, see Physical View and Logical View of a Sun Fire 6800/4810/4800/3800 System.
If the system is divided into multiple domains, as a domain administrator you can see detailed information only for domains to which you can access. If you attempt to view a domain to which you do not have access privilege, the message "Insufficient security privilege to load console info" is displayed at the bottom of the Console window.
FIGURE 4-6 shows a physical view of Paroli cards in a domain. Access this view by clicking on the Hardware tab, clicking on the Views list box, and clicking system under Domain. Be sure you have system - Rear in the Rotate Current View list box.
FIGURE 4-6 Domain Physical View of Paroli Cards (Rear)