Sun Logo


Sun Fire X4500/X4540 Servers Diagnostics Guide

819-4363-12



Contents

Preface

Part I Sun Fire X4500 Server Diagnostics Guide

1. Initial Inspection of the Server

Service Visit Troubleshooting Flowchart

Gathering Service Visit Information

Troubleshooting Power Problems

Externally Inspecting the Server

Internally Inspecting the Server

Troubleshooting DIMM Problems

How DIMM Errors Are Handled By the System

Uncorrectable DIMM Errors

Correctable DIMM Errors

BIOS DIMM Error Messages

DIMM Fault LEDs

DIMM Population Rules

Supported DIMM Configurations

Isolating and Correcting DIMM ECC Errors

2. Using SunVTS Diagnostic Software

Running SunVTS Diagnostic Tests

SunVTS Documentation

Diagnosing Server Problems With the Bootable Diagnostics CD

Requirements

Using the Bootable Diagnostics CD

3. Using the ILOM Service Processor GUI to View System Information

Making a Serial Connection to the SP

Viewing ILOM SP Event Logs

Interpreting Event Log Time Stamps

Viewing Replaceable Component Information

Viewing Temperature, Voltage, and Fan Sensor Readings

procedure iconsmall spaceTo View Sensor Readings:

4. Using IPMItool to View System Information

About IPMI

About IPMItool

IPMItool Man Page

Connecting to the Server With IPMItool

Enabling the Anonymous User

Changing the Default Password

Configuring an SSH Key

Using IPMItool to Read Sensors

Reading Sensor Status

Reading All Sensors

Reading Specific Sensors

Using IPMItool to View the ILOM SP System Event Log

Viewing the SEL With IPMItool

Clearing the SEL With IPMItool

Using the Sensor Data Repository (SDR) Cache

Sensor Numbers and Sensor Names in SEL Events

Viewing Component Information With IPMItool

Viewing and Setting Status LEDs

LED Sensor IDs

LED Modes

LED Sensor Groups

Using IPMItool Scripts For Testing

5. Event Logs and POST Codes

Viewing Event Logs

Power-On Self-Test (POST)

How BIOS POST Memory Testing Works

Redirecting Console Output

Changing POST Options

procedure iconsmall spaceTo Change POST Options

POST Codes

POST Code Checkpoints

6. Status Indicator LEDs

External Status Indicator LEDs

Exterior Features, Controls, and Indicators

Front Panel

Rear Panel

Internal Status Indicator LEDs

Disk Drive and Fan Tray LEDs

CPU Board LEDs

7. hd Utility

Overview of the hd Utility

Using the hd Utility

hd Utility Mapping

hd Command Options and Parameters

hd Man page

Options Parameters

Example Using the hd Utility

Sun Fire X4500 Disk Mapping

Pre-ILOM 2.0.2.5 and ILOM 2.0.2.5 and Later

Pre-ILOM 2.0.2.5 and No USB Devices

Pre-ILOM 2.0.2.5 and One USB Device

ILOM 2.0.2.5 or Later and No USB Device

ILOM 2.0.2.5 or Later and One USB Device

ILOM 2.0.2.5 or Later and Three USB Storage Devices

A. Sun Fire X4500 Sensor Locations

B. Error Handling

Handling of Uncorrectable Errors

Handling of Correctable Errors

Handling of Parity Errors (PERR)

Handling of System Errors (SERR)

Handling Mismatching Processors

Hardware Error Handling Summary

Part II Sun Fire X4540 Server Diagnostics Guide

8. Initial Inspection of the Server

Service Visit Troubleshooting Flowchart

Gathering Service Visit Information

Troubleshooting Power Problems

External Inspection of the Server

Internal Inspection of the Server

9. Using SunVTS Diagnostic Software

About SunVTS Diagnostic Software

Accessing SunVTS

SunVTS Documentation

Running SunVTS Diagnostic Tests

Using the Bootable Diagnostics CD

SunVTS Log Files

Requirements

Using the Bootable Diagnostics CD

Reviewing SunVTS Log Files

10. Troubleshooting DIMM Problems

DIMM Population Rules

Supported DIMM Configurations

DIMM Replacement Policy

How DIMM Errors Are Handled by the System

Uncorrectable DIMM Errors

Correctable DIMM Errors

BIOS DIMM Error Messages

DIMM Fault LEDs

Isolating and Correcting DIMM ECC Errors

11. Using the ILOM Service Processor GUI to View System Information

Connecting the SP to a Serial Port

Viewing ILOM SP Event Logs

Interpreting Event Log Time Stamps

Viewing Replaceable Component Information

Viewing Temperature, Voltage, and Fan Sensor Readings

To View Sensor Readings:

12. Using IPMItool to View System Information

About IPMI

About IPMItool

IPMItool Man Page

Connecting to the Server With IPMItool

Enabling the Anonymous User

Changing the Default Password

Configuring an SSH Key

Using IPMItool to Read Sensors

Reading Sensor Status

Reading All Sensors

Reading Specific Sensors

Using IPMItool to View the ILOM SP System Event Log

Viewing the SEL With IPMItool

Clearing the SEL With IPMItool

Using the Sensor Data Repository (SDR) Cache

Sensor Numbers and Sensor Names in SEL Events

Viewing Component Information With IPMItool

Viewing and Setting Status LEDs

LED Sensor IDs

LED Modes

LED Sensor Groups

Using IPMItool Scripts for Testing

13. Event Logs and POST Codes

Viewing Event Logs

About Power-On Self-Test (POST)

BIOS POST Memory Test Overview

Redirecting Console Output

Changing POST Options

procedure iconsmall spaceTo Change POST Options

POST Codes

POST Code Checkpoints

14. Identifying Status and Fault LEDs

Front Panel Features

Rear Panel Features

Internal Status Indicator LEDs

Disk Drive and Fan Tray LEDs

CPU Board LEDs

C. Sun Fire X4540 Sensor Locations

D. Error Handling

Uncorrectable Errors

Correctable Errors

Parity Errors (PERR)

System Errors (SERR)

Handling Mismatched Processors

Hardware Error Handling Summary

Index