JavaScript is required to for searching.
Skip Navigation Links
Exit Print View
Sun Fire X4640 Server Product Documentation     Sun Fire X4640 Server Documentation Library
search filter icon
search icon

Document Information

Using This Documentation

Product Downloads

Get Software and Firmware Downloads

About This Documentation (PDF and HTML)

We Welcome Your Comments

Change History

Hardware Installation and Product Notes

Hardware Installation

Installation Overview

Sun Fire X4640 Server Front and Back Panel Features and Components

Installing the Server Hardware

Removing the Server From the Rack

Cabling and Power

Getting Service for Your Server

Sun Fire X4640 Server Specifications

Managing Your Server

Communicating With the ILOM and the System Console

Setting Up Your Operating System

Product Notes

Overview of the Sun Fire X4640 Server Product Notes

Supported Software and Firmware

Hardware Issues

Solaris Operating System Issues

Linux Operating System Issues

Windows Operating System Issues

VMware ESX and VMware ESXi Issues

Sun Installation Assistant Issues

MegaRAID Storage Manager (MSM) Issues

Integrated Lights Out Manager (ILOM) Issues

BIOS Issues

Documentation Issues

Operating System Installation

Sun Installation Assistant

Introduction to Sun Installation Assistant

Getting Started With Sun Installation Assistant

Starting SIA and Preparing for Deployment or Recovery Tasks

Configuring RAID

Installing Windows With SIA

Installing Linux With SIA

Using SIA to Upgrade System Firmware

Using SIA to Recover a Service Processor

Performing an SIA PXE-Based Attended Installation

Performing an SIA PXE-Based Unattended Installation

Observing an SIA PXE-Based Unattended Installation

Troubleshooting SIA

Creating a Bootable SIA USB Flash Drive

Identifying a Linux Boot Device on a Sun Fire X4500 Server

Installing Service Tags

Solaris Operating System

Introduction to Solaris OS and OpenSolaris Installation

Installing the Solaris 10 Operating System

Installing the OpenSolaris Operating System

Booting From OS Distribution Media

Identifying Logical and Physical Network Interface Names for Solaris OS Installation

Preliminary Tasks Before Installing An OS

Linux

Introduction to Linux Installation

Sun Installation Assistant (SIA)

Installing Red Hat Enterprise Linux

Installing and Updating SUSE Linux Enterprise Server

Configuring a Linux Server to Support PXE Installation

Booting From OS Distribution Media

Preliminary Tasks Before Installing An OS

Identifying Logical and Physical Network Interface Names for Linux OS Configuration

Windows

Introduction to Windows Installation

Sun Installation Assistant (SIA)

Getting Started With Windows Server 2008 Installation

Downloading Server Software

Selecting a Media Delivery Method

Configuring a Remote Console

Installing Windows Server 2008

Updating Critical Drivers and Installing Supplemental Software

Incorporating Sun Fire Drivers Into a WIM Image

Identifying Network Interfaces in Windows

ESX

Introduction to ESX Installation

Installing VMware ESX 4

Installing VMware ESXi 4.0 Installable

Booting From OS Distribution Media

Preliminary Tasks Before Installing An OS

Administration, Diagnostics, and Service

Administration Using ILOM

Overview of the ILOM Supplement

Introduction to ILOM Software

Communicating With the ILOM and the System Console

Updating Firmware

Using ILOM to Monitor the Host

Diagnostics

Overview of the Diagnostics Guide

Introduction to System Diagnostics

Troubleshooting Options

Diagnostic Tools

Troubleshooting the Server

How to Gather Service Visit Information

How to Troubleshoot Power Problems

How to Inspect the Outside of the Server

How to Inspect the Inside of the Server

Troubleshooting DIMM Problems

DIMM Fault LEDs

DIMM Population Rules

How to Isolate and Correct DIMM ECC Errors

Identifying Correctable DIMM Errors (CEs)

How to Identify CEs on a Solaris Server

How to Identify CEs on a Linux Server

How to Identify CEs on a Windows Server

Identifying BIOS DIMM Error Messages

Using the ILOM to Monitor the Host

Viewing the ILOM Sensor Readings

How to Use the ILOM Web Interface to View the Sensor Readings

How to Use the ILOM Command-Line Interface to View the Sensor Readings

Viewing the ILOM System Event Log

How to View the System Event Log Using the ILOM Web Interface

How to View the System Event Log With the ILOM Command-Line Interface

Clearing the Faults from the System Event Log

How to Clear Faults From the System Event Log Using the ILOM Web Interface

How to Clear Faults From the System Event Log Using the ILOM Command-Line Interface

Interpreting Event Log Time Stamps

Using SunVTS Diagnostics Software

Introduction to SunVTS Diagnostic Test Suite

SunVTS Documentation

How to Diagnose Server Problems With the Bootable Diagnostics CD

Creating a Data Collector Snapshot

How To Create a Snapshot With the ILOM Web Interface

How To Create a Snapshot With the ILOM Command-Line Interface

Resetting the SP

How to Reset the ILOM SP Using the Web Interface

How to Reset the ILOM SP Using the Command-Line Interface

Service

Sun Fire X4640 Server Service Manual Overview

Controlling Power and Performing Hardware Reset

Removing and Installing Components

Configuring the System Using the BIOS Setup Utilities

Sun Fire X4640 Server References and Specifications

Index

How to Isolate and Correct DIMM ECC Errors

If the ILOM reports an ECC error or a problem with a DIMM, first complete the steps in the following procedure.

In this example, ILOM reports an error with the DIMM in CPU0, slot 1. The fault LEDs on CPU0, slots 1 and 0, are lit.

Refer to Using the ILOM to Monitor the Hostfor information on locating component errors.


Caution

Caution - Before handling components, attach an antistatic wrist strap to a chassis ground (any unpainted metal surface). The system’s printed circuit boards and hard disk drives contain components that are extremely sensitive to static electricity.


  1. If you have not already done so, shut down your server to standby power mode and remove the cover.

    Refer to the Sun Fire X4640 Server Service Manual.

  2. Inspect the CPU fault LEDs for each CPU module. The CPU fault LED will be lit on the CPU module that has the faulty DIMM.
  3. Disconnect the AC power cords from the server.
  4. Remove the CPU module that has the DIMM problem.

    Refer to the Sun Fire X4640 Server Service Manual.

  5. Inspect the installed DIMMs to ensure that they comply with the DIMM Population Rules in the Sun Fire X4640 Server Service Manual.
  6. Press the Fault Remind button on the CPU module to light the faulty DIMM LEDs.

    See DIMM Fault LEDs for the location of the Fault Remind button and DIMM fault LEDs.

  7. Inspect the fault LEDs on the DIMM slot ejectors.

    If any of these LEDs are lit, they can indicate the component with the fault.

  8. Remove the DIMMs from the CPU module.

    Refer to the Sun Fire X4640 Server Service Manual.

  9. Visually inspect the DIMMs for physical damage, dust, or any other contamination on the connector or circuits.
  10. Visually inspect the DIMM slot for physical damage. Look for cracked or broken plastic on the slot.
  11. Dust off the DIMMs, clean the contacts, and reseat them.
  12. If there is no obvious damage, exchange the individual DIMMs between the two slots of a given pair. Ensure that they are inserted correctly with ejector latches secured. Using the slot numbers from the example:
    1. Remove the DIMMs from CPU0, slots 1 and 0.
    2. Reinstall the DIMM from slot 1 into slot 0.
    3. Reinstall the DIMM from slot 0 into slot 1.
  13. Reinstall the CPU module that has the DIMM problem.

    Refer to the Sun Fire X4640 Server Service Manual.

  14. Reconnect AC power cords to the server.
  15. Power on the server and run the diagnostics test again.
  16. Review the log file.
    • If the error now appears in CPU0, slot 0 (the opposite of the original error in slot 1), the problem is related to the individual DIMM. In this case, return both DIMMs (the pair) to the Support Center for replacement.

    • If the error still appears in CPU0, slot 1 (as the original error did), the problem is not related to an individual DIMM. Instead, it might be caused by CPU0 or by the DIMM slot. Continue with the rest of the procedure.

  17. Shut down the server again and disconnect the AC power cords.
  18. Remove the CPU module that has the DIMM problem, and remove another CPU module that does not indicate a DIMM problem.

    Refer to the Sun Fire X4640 Server Service Manual.

  19. Remove both DIMMs of the pair and install them into paired slots on the second CPU module that did not indicate a DIMM problem.

    Using the slot numbers in the example, install the two DIMMs from CPU0, slots 1 and 0 into CPU1, slots 1 and 0 or CPU1, slots 3 and 2.

  20. Reinstall both CPU modules that you removed.

    Refer to the Sun Fire X4640 Server Service Manual.

  21. Reconnect AC power cords to the server.
  22. Power on the server and run the diagnostics test again.
  23. Review the log file.
    • If the error now appears under the CPU that manages the DIMM slots you just installed, the problem is with the DIMMs. Return both DIMMs (the pair) to the Support Center for replacement.

    • If the error remains with the original CPU, there is a problem with that CPU module.