JavaScript is required to for searching.
Skip Navigation Links
Exit Print View
Sun SPARC Enterprise T5440 Server Topic Set
search filter icon
search icon

Document Information

Using This Documentation

Related Documentation

Documentation, Support, and Training

Site Planning

SPARC Enterprise T5440 Site Planning Guide

Physical Specifications

Minimum Clearance for Service Access

Environmental Specifications

Power Source Requirements

Acoustic Noise Emissions

Agency Compliance Specifications

Operating Environment Requirements

Installation and Setup

Preparing for Installation

About the SPARC Enterprise T5440 Server

Tools and Equipment Needed

Installation Overview

Cabling Notes

About the Slide Rail Assembly

About the Cable Management Arm

Rack Compatibility Requirements

Safety Precautions

ESD Precautions

Installing the SPARC Enterprise T5440 Server

Installing the Rack Slide Assembly

Installing the Server Into the Cabinet

Installing the Cable Management Arm (CMA)

Connecting the Server Cables

Managing Cables With the CMA

Powering On the System

Powering On the System for the First Time

Enabling the Service Processor Network Management Port

Logging Into the Service Processor

Using the Service Processor for Common Operations

Booting the Solaris Operating System

Updating the Firmware

Updating Service Processor and Server Firmware

Updating the Firmware

Selecting a Boot Device

Select a Boot Device

Administration

Basic Administrative Tasks

Initial System Setup

Communicating With the System

Performing Common Tasks

Manage Disk Volumes

Hardware RAID Support

Creating RAID Volumes

Removing Hardware RAID Volumes

Manage the System

Searching for Faults

Bypassing Minor Faults

Clearing Faults

Managing Devices

Use Logical Domains Software

Understanding Logical Domains Software

Operating the Solaris OS With Logical Domains

ALOM-to-ILOM Command Reference

ILOM and ALOM CMT Command Comparison Tables

Remote Management

Understanding ILOM for the Sun SPARC Enterprise T5440 Server

ILOM Overview

Platform-Specific ILOM Features

ILOM Features Not Supported

Managing the Host

Resetting the Host

Managing Host Boot Mode

Viewing and Configuring Host Control Information

Managing System User Interactions

Managing the Service Processor

Storing Customer Information

Display Console History (CLI)

Change Console Escape Characters (CLI)

Changing Configuration Policy Settings

Managing Network Access

Managing Devices

Managing Virtual Keyswitch Settings

IPMI Sensor Reference

Sensors on Sun SPARC Enterprise T5440 Server

Indicators on the Sun SPARC Enterprise T5440 Server Server

ALOM CMT Compatibility Shell

Limits to Backward Compatibility

Create an ALOM CMT Compatibility Shell

ILOM and ALOM CMT Command Comparison

ALOM CMT Variables

ALOM CMT Variable Comparison

Event Messages Available Through the ALOM Compatibility Shell

Event Message Overview

Event Severity Levels

Service Processor Usage Event Messages

Environmental Monitoring Event Messages

Host Monitoring Event Messages

SCC Backup and Restore

Information Stored on The SCC

Service

Identifying Server Components

Infrastructure Boards and Cables

Front Panel Diagram

Front Panel LEDs

Rear Panel Diagram

Rear Panel LEDs

Ethernet Port LEDs

Managing Faults

Understanding Fault Handling Options

Server Diagnostics Overview

Diagnostic Flowchart

Options for Accessing the Service Processor

Oracle ILOM Overview

ALOM CMT Compatibility Shell Overview

Predictive Self-Healing Overview

Oracle VTS Overview

POST Fault Management Overview

POST Fault Management Flowchart

Memory Fault Handling Overview

Connecting to the Service Processor

Switch From the System Console to the Service Processor (Oracle ILOM or ALOM CMT Compatibility Shell)

Switch From Oracle ILOM to the System Console

Switch From the ALOM CMT Compatibility Shell to the System Console

Displaying FRU Information With Oracle ILOM

Display System Components (Oracle ILOM show components Command)

Display Individual Component Information (Oracle ILOM show Command)

Controlling How POST Runs

POST Parameters

Change POST Parameters

Run POST in Maximum Mode

Detecting Faults

Detecting Faults Using LEDs

Detecting Faults (Oracle ILOM show faulty Command)

Detect Faults (Oracle ILOM show faulty Command)

Detecting Faults (Oracle Solaris OS Files and Commands)

Check the Message Buffer

View System Message Log Files

Detecting Faults (Oracle ILOM Event Log)

View Oracle ILOM Event Log

Detecting Faults (Oracle VTS Software)

About Oracle VTS Software

Verify Installation of Oracle VTS Software

Start the Oracle VTS Browser Environment

Oracle VTS Software Packages

Useful Oracle VTS Tests

Detecting Faults Using POST

Identifying Faults Detected by PSH

Detect Faults Identified by the Oracle Solaris PSH Facility (Oracle ILOM fmdump Command)

Clearing Faults

Clear Faults Detected During POST

Clear Faults Detected by PSH

Clear Faults Detected in the External I/O Expansion Unit

Disabling Faulty Components

Disabling Faulty Components Using Automatic System Recovery

Disable System Components

Re-Enable System Components

Oracle ILOM-to-ALOM CMT Command Reference

Preparing to Service the System

Safety Information

Required Tools

Obtain the Chassis Serial Number

Obtain the Chassis Serial Number Remotely

Powering Off the System

Extending the Server to the Maintenance Position

Remove the Server From the Rack

Perform Electrostatic Discharge – Antistatic Prevention Measures

Remove the Top Cover

Servicing Customer-Replaceable Units

Hot-Pluggable and Hot‐Swappable Devices

Servicing Hard Drives

Servicing Fan Trays

Servicing Power Supplies

Servicing PCIe Cards

Servicing CMP/Memory Modules

Servicing FB-DIMMs

Servicing Field-Replaceable Units

Servicing the Front Bezel

Servicing the DVD-ROM Drive

Servicing the Service Processor

Servicing the IDPROM

Servicing the Battery

Servicing the Power Distribution Board

Servicing the Fan Tray Carriage

Servicing the Hard Drive Backplane

Servicing the Motherboard

Servicing the Flex Cable Assembly

Servicing the Front Control Panel

Servicing the Front I/O Board

Returning the Server to Operation

Install the Top Cover

Install the Server Into the Rack

Slide the Server Into the Rack

Connect the Power Cords to the Server

Power On the Server

Performing Node Reconfiguration

I/O Connections to CMP/Memory Modules

Recovering From a Failed CMP/Memory Module

System Bus Topology

I/O Fabric in 2P Configuration

I/O Fabric in 4P Configuration

Identifying Connector Pinouts

Serial Management Port Connector Pinouts

Network Management Port Connector Pinouts

Serial Port Connector Pinouts

USB Connector Pinouts

Gigabit Ethernet Connector Pinouts

Server Components

Customer-Replaceable Units

Field-Replaceable Units

Index

Diagnostic Flowchart

The following diagnostics are available to troubleshoot faulty hardware. See Change POST Parameters for more information about each diagnostic in this chapter.

Figure 14 Diagnostic Flowchart

Figure showing the diagnostic flowchart.

Table 35 Diagnostic Flowchart Actions 

Action No.
Diagnostic Action
Resulting Action
For more information
1.
Check Power OK and AC Present LEDs on the server.
The Power OK LED is located on the front and rear of the chassis.

The AC Present LED is located on the rear of the server on each power supply.

If these LEDs are not on, check the power source and power connections to the server.

2.
Run the Oracle ILOM show faulty command to check for faults.
The show faulty command displays the following kinds of faults:
  • Environmental faults

  • External I/O Expansion Unit faults

  • Predictive Self-Healing (PSH) detected faults

  • POST-detected faults

Faulty FRUs are identified in fault messages using the FRU name.


Note - If the Oracle ILOM show faulty output includes an error string such as Ext sensor or Ext FRU, it indicates a fault in the External I/O Expansion Unit.


3.
Check the Oracle Solaris log files and Oracle ILOM system event log for fault information.
The Oracle Solaris log files and the Oracle ILOM system event log record system events, and provide information about faults.
  • Browse the Oracle ILOM system event log for major or critical events. Some problems are logged in the event log but not added to the show faulty list

  • If system messages indicate a faulty device, replace the FRU.

  • To obtain more diagnostic information, go to Action No. 4

4.
Run Oracle VTS software.
Oracle VTS is an application you can run to exercise and diagnose FRUs. To run Oracle VTS, the server must be running the Oracle Solaris OS.
  • If Oracle VTS reports a faulty device, replace the FRU.

  • If Oracle VTS does not report a faulty device, go to Action No. 5.

5.
Run POST.
POST performs basic tests of the server components and reports faulty FRUs.
6.
Determine if the fault is an environmental or configuration fault.
Determine if the fault is an environmental fault or a configuration fault.

If the fault listed by the show faulty command displays a temperature or voltage fault, then the fault is an environmental fault. Environmental faults can be caused by faulty FRUs (power supply or fan), or by environmental conditions such as when computer room ambient temperature is too high, or the server airflow is blocked. When the environmental condition is corrected, the fault will automatically clear.

If the fault indicates that a fan or power supply is bad, you can perform a hot-swap of the FRU. You can also use the fault LEDs on the server to identify the faulty FRU (fans and power supplies).

If the FRU displayed by the show faulty command is /SYS, the fault is a configuration problem. /SYS indicates no faulty FRU has been diagnosed, but there is a problem with the system configuration.

7.
Determine if the fault was detected in the External I/O Expansion Unit.
Problems detected in the External I/O Expansion Unit include the text string Ext FRU or Ext Sensor at the beginning of the fault description.
8.
Determine if the fault was detected by PSH.
If the fault displayed included a uuid and sunw‐msg-‐id property, the fault was detected by the Predictive Self-Healing software.

If the fault is a PSH-detected fault, refer to the PSH Knowledge Article web site for additional information. The Knowledge Article for the fault is located at the following link:

http://www.sun.com/msg/message-ID

where message-ID is the value of the sunw-msg-id property displayed by the show faulty command.

After the FRU is replaced, perform the procedure to clear PSH-detected faults.

9.
Determine if the fault was detected by POST.
POST performs basic tests of the server components and reports faulty FRUs. When POST detects a faulty FRU, it logs the fault and if possible, takes the FRU offline. POST detected FRUs display the following text in the fault message:

Forced fail reason

In a POST fault message, reason is the name of the power-on routine that detected the failure.

10.
Contact technical support.
The majority of hardware faults are detected by the server’s diagnostics. In rare cases a problem might require additional troubleshooting. If you are unable to determine the cause of the problem, contact your service representative for support.
Related Information