C H A P T E R  4

ILOM 2.0 Diagnostic Tools

The ILOM 2.0 software running on your service processor (SP) has a complete suite of diagnostic capabilities. You can view event logs, the state of the sensors on the server node, and a list of critical faults, if any.

This chapter includes the following topics:


Accessing the SP

Prerequisites

At minimum, you need to know the IP address of the Chassis Monitoring Module (CMM). Refer to the Sun Integrated Lights Out Manager User’s Guide for a procedure to obtain the CMM IP address.

After you have obtained the IP address of the CMM, you can obtain the IP address of the SP. The next topic describes a procedure for obtaining the SP IP address through the CMM ILOM web interface. Refer to Sun Blade X6275 Server Module Installation Guide for a procedure using the ILOM CLI.

Choose one of the following methods to log in to the SP:

After you have obtained the IP address of the CMM, you can obtain the IP address of the SP. The next topic describes a procedure for obtaining the SP IP address through the CMM ILOM web interface. Refer to Chapter 3 of the Sun Blade X6275 Server Module Installation Guide for a procedure using the ILOM CLI.


procedure icon  To Log In to the SP Through the CMM ILOM Web Interface

1. Open a browser and type http://<CMM IP address>

The CMM ILOM welcome screen opens.

2. Enter root as your username and changeme as your password (unless you have previously changed your password).

The Chassis View appears in the ILOM web interface, as shown here.

FIGURE 4-1 ILOM 2.0 GUI: Chassis View


Figure showing chassis view in ILOM GUI.



Note - The upper nodes (node 0) in each blade are numbered from 0-11. Node 1 in each blade has the same number as node 0 plus 12. For example, the nodes in the second blade from the left are numbered 1 (node 0) and 13 (node 1).


3. To log in to the SP for a node, click on the node’s name in the left navigation pane or in the image in the right pane.

The home page (System Information tab) for the node appears in the ILOM web interface:

FIGURE 4-2 ILOM 2.0 GUI: System Information > Versions View


Figure showing the System Information view for a blade in the ILOM GUI.



Note - When you are logged in to the SP through the CMM ILOM web interface, you can obtain the IP address of the SP by selecting the Configuration tab and then the Network tab.



procedure icon  To Log In to the SP ILOM Web Interface Directly

1. Open a browser and type http://<SP IP address>

The SP ILOM welcome screen opens.

2. Enter root as your username and changeme as your password (unless you have previously changed your password).

The ILOM web interface presents the home page (System Information tab) for the node.


procedure icon  To Log In to the SP CLI Through the CMM ILOM CLI

1. Open a terminal and type ssh root@<CMM IP address>

2. Enter changeme as your password (unless you have previously changed your password).

You are logged in to the CMM CLI. The CLI prompt appears:

-> 

3. To log in to the CLI for blade 6, for example, enter this command:

-> start /CH/BL6/SP/cli

You are logged in to the CLI for both the node and its SP. The CLI prompt appears:

-> 

4. Change directories to the host or to the SP:

To change directories to the host, type:

cd /SYS

To change directories to the SP, type:

cd /SP

procedure icon  To Log In to the SP ILOM CLI Directly

1. Open a terminal and type:

->ssh root@SP_IP_address

2. Enter changeme as your password (unless you have previously changed your password).

You are logged in to the CLI for both the node and its SP. The CLI prompt appears:

-> 

3. Change directories to the host or to the SP:

To change directories to the host, type:

cd /SYS 

To change directories to the SP, type:

cd /SP


Monitoring the Server Node With ILOM

Choose one of the following methods for monitoring your node with ILOM:

Monitoring Status Using the ILOM Web Interface

You can use the System Information and System Monitoring views of the ILOM web interface to monitor the status of the server module and its components.


procedure icon  To Monitor Status Using the ILOM Web Interface

1. Access a web browser.

2. Log in to the web interface. See one of the following:

To Log In to the SP Through the CMM ILOM Web Interface

To Log In to the SP ILOM Web Interface Directly

The System Information view opens.

3. Select the submenu screens to view system and component information.

TABLE 4-1 lists the System Information submenu tab functions.


TABLE 4-1 The ILOM System Information Tab Submenu Screens and Tasks

System Monitoring Tab

Tasks

Versions

View server board and SP versions.

Session Time-Out

Select an inactivity time-out for your session.

Components

View information about all the components that are present.

Fault Management

See Fault Management.

Identification Information

Change the SP identification information, such as host name and address.


Refer to the Integrated Lights Out Manager Administration Guide for more information about the System Information tab.

4. Click the System Monitoring tab.

The System Monitoring submenu screens appear.

TABLE 4-2 lists the submenu tabs and tasks.


TABLE 4-2 The ILOM System Monitoring Tab Submenu Screens

System Monitoring Tab

Tasks

Sensors Reading

View the name, type and readings of sensors.

Indicators

View the name and status of the LEDs. Shows both front-panel and internal LEDs.

Event Logs

View events, including details such as event ID, class, type, severity, date and time, and description of event.

Power Management

View actual, permitted, and available power.


Refer to the Integrated Lights Out Manager Administration Guide, for more information about the System Monitoring tab.

Monitoring Status Using the ILOM CLI

You can use the ILOM CLI to monitor the server module status and the status of its components


procedure icon  To Monitor Status Using the ILOM CLI

1. Open a terminal.

2. Log in to the CLI. See one of the following:

To Log In to the SP CLI Through the CMM ILOM CLI

To Log In to the SP ILOM CLI Directly).

The CLI prompt appears:

-> 

3. Navigate to the /SYS namespace.

4. Use the CLI cd (change directory) command to navigate to individual components listed below:


Component Name

Type

/SYS

Host system

/SYS/SP

Service Processor

/SYS/SP/NET0

Network interface

/SYS/MB

Motherboard

/SYS/MB/P0

Host processor

/SYS/MB/P0/D1

DIMM

/SYS/MB/P0/D3

DIMM

/SYS/MB/P0/D5

DIMM

/SYS/MB/BIOS

BIOS

/SYS/MB/CPLD

NVRAM

/SYS/MB/NET0

Network interface

/SYS/MIDPLANE

Chassis

/SYS/PS1

Power supply

/SYS/NEM1

Network module

/SYS/CMM

Chassis Monitoring Module


5. For example, from the /SYS namespace, enter cd MB/P0/D1 to look at DIMM 1 of processor 0.

You see the following:

FIGURE 4-3 show Command on a DIMM


Figure showing CLI output for DIMM 1.

Here are other examples:

FIGURE 4-4 show Command on Processor 0


Figure showing CLI output for processor 0.

FIGURE 4-5 showCommand on the Motherboard


Figure showing CLI output for motherboard.

For more information about using the CLI to monitor and manage the server, refer to the Integrated Lights Out Manager Administration Guide.


Fault Management

Your ILOM software includes the ability to diagnose faults as they occur, where faults are defined as system component failures or chassis problems, such as environmental parameters outside acceptable ranges.

ILOM reports faults for server node components that have a fault_state property. You can see which components have this property by selecting the Components tab in the System Information view (tab). Components with the fault_state property show OK or Fault in the Fault Status column of the Component Management Status table. Components that lack the fault_state property show a hyphen (-).

Chassis environmental values are not shown in the Component Management Status table although they can have a faulted status and can be listed in the Faulted Components table of the Fault Management view (tab).

Chassis faults include the following:

The following figure shows a typical list of system components. Only those that have entries other than a hyphen (-) in the Fault Status column can be reported as faults.

FIGURE 4-6 ILOM 2.0 GUI: System Information > Component Management View


Figure showing the Component Management view.

Fault Types

There are three types of faults:

When correctable and uncorrectable faults occur, they do the following:



Note - The SEL contains entries for every event that occurs, such as starting and stopping the system, while the Fault Management list includes events that require service action. The Fault Management list is much more specific than the SEL.


Clearing Faults

When you correct a component fault, you can clear it from the SP ILOM web interface System Information --> Components tab, using the Clear Faults action from the Actions drop-down list box. This also clears the fault from the Fault Management list and turns off the Service Action Required LED. Refer to the Sun Blade X6275 ILOM Supplement for a list of faults that you can clear.

Chassis faults must be cleared from the CMM, at which time they are also cleared from the SP.

Viewing Faults

Choose one of the following methods for viewing faults with ILOM:


procedure icon  To View the Fault Management List With the ILOM Web Interface

1. Log in to the ILOM web interface for the server node’s SP. See one of the following:

To Log In to the SP Through the CMM ILOM Web Interface

To Log In to the SP ILOM Web Interface Directly

The interface opens to the System Information tab.

2. Select the System Information --> Fault Management tab.

The Faulted Components list appears. If there are no faults, the list contains only the entry “No Items To Display.” If there is a fault, the faulted component is listed by component name, as shown in the following figure.

FIGURE 4-7 ILOM 2.0 GUI: System Information > Component Management View


Figure showing the Fault Management view.

3. Click the component name to obtain more information.

A dialog box opens with more detail. The following figure shows a fault on DIMM 7 of CPU 1 on the motherboard (MB).

FIGURE 4-8 ILOM 2.0 GUI: System Information > Fault Management > Fault Dialog Box


Figure showing the Fault dialog box.


procedure icon  To View the Fault Management List With the ILOM CLI

1. Log in to the ILOM CLI for the server node’s SP. See one of the following:

To Log In to the SP CLI Through the CMM ILOM CLI

To Log In to the SP ILOM CLI Directly

The CLI prompt appears:

-> 

2. Type the command show /SP/faultmgmt

To list all details, use the command:

-> show /SP/faultmgmt -level all

Faults in the ILOM System Event Log

Faults are written to the ILOM system event log. For example, an entry for a power supply fault might look like this:

Fault detected at time = Wed Jan 21 21:40:20 2009. The suspect component: /SYS/PS0 has fault.chassis.env.power.loss with probability=100. Refer to http://www.sun.com/msg/SPX86-8000-55 for details.

SPX86-8000-55 is a Knowledge Article that you can look up on www.sun.com/msg.

http://www.sun.com/msg/SPX86-8000-55 is a web page with a description of the fault, how to repair or clear it, and so forth.

Choose one of the following methods for viewing the ILOM system event log:


procedure icon  To View the System Event Log With the ILOM Web Interface

When a fault occurs, an entry is written to the system event log.

1. Log in to the ILOM web interface for the server node’s SP. See one of the following:

To Log In to the SP Through the CMM ILOM Web Interface

To Log In to the SP ILOM Web Interface Directly

The interface opens to the System Information tab.

2. Select the System Monitoring tab.

3. Select the System Monitoring --> Event Logs tab.

The Event Log opens, as shown in the following figure.

FIGURE 4-9 ILOM 2.0 GUI: System Monitoring > Event Logs (SEL) View


Figure showing Faults in the SEL.


procedure icon  To View the System Event Log With the ILOM CLI

1. Log in to the ILOM CLI for the server node’s SP. See one of the following:

To Log In to the SP CLI Through the CMM ILOM CLI

To Log In to the SP ILOM CLI Directly

2. Enter the command:

-> show /SP/logs/event/list


Data Collection Snapshot

ILOM’s Data Collector utility is used to create a “snapshot” of the server node that you can send to Sun Services for diagnosis. The utility collects log files, runs various commands and collects their output, and sends the data collection as a file to a user-defined location.



Note - The purpose of the ILOM Data Collector utility is to collect data for use by Sun Services to diagnose problems. You should not run this utility unless requested to do so by Sun Services.


Choose one of the following methods for creating a data collector snapshot:


procedure icon  To Create a Data Collector Snapshot Using the ILOM Web Interface

1. Log in to the ILOM web interface for the server node’s SP. See one of the following:

To Log In to the SP Through the CMM ILOM Web Interface

To Log In to the SP ILOM Web Interface Directly

The interface opens to the System Information tab.

2. Select the Maintenance tab.

3. Select the Data Collector tab.

The Data Collector panel opens, as shown in the following figure.

FIGURE 4-10 ILOM 2.0 GUI: Maintenance > Service Snapshot View


Figure showing the Data Collector view.

You can choose a basic or extended data set. You can also choose to send only event logs. Event logs include the system event log and several others.

4. To send maximum information, choose the Full and Disable radio buttons and click the Run button.

or

To send minimum information, choose the Normal and Enable radio buttons and click the Run button.

A Save As dialog box appears.

5. Specify the path and filename of the file where you want the results written.

6. Click OK to save the file.


procedure icon  To Create a Data Collection Snapshot Using the ILOM CLI

1. Log in to the ILOM CLI for the server node’s SP. See one of the following:

To Log In to the SP CLI Through the CMM ILOM CLI

To Log In to the SP ILOM CLI Directly

The CLI prompt appears:
->

2. Use the set command:

-> set /SP/diag/snapshot dataset=MODE

For example:

-> set dataset=full

Set ’dataset’ to ’full’

where MODE can be:

normal--(default) Specifies that ILOM, operating system, and hardware information is to be collected.

normal-logonly

full--Specifies that the maximum information is to be collected.

full-logonly



Note - Because normal is the default, in most cases you do not need to set the dataset property.




Note - Using the full option may reset the running host.


3. To initiate the snapshot data collection, enter:

-> set /SP/diag/snapshot dump_uri=URI

where URI specifies the target directory in the format:

protocol://username:password@host/directory

For example:

ftp://username:password@host_ip_address/data 

where password is the actual administrator password for the domain.

And where the directory, data is relative to the user’s login.


procedure icon  To View the Dataset Properties

1. Use the cd command to get to the /SP/diag/snapshot directory:

-> cd SP/diag/snapshot
/SP/diag/snapshot

2. Use the show command to view the dataset property:

-> show
 
 /SP/diag/snapshot
    Targets:
 
    Properties:
        dataset = full
        dump_uri = (Cannot show property)
        result = (none)
 
    Commands:
        cd
        set
        show