JavaScript is required to for searching.
Skip Navigation Links
Exit Print View
Sun Server X2-8 Product Documentation     Sun Server X2-8 (formerly Sun Fire X4800 M2) Documentation Library
search filter icon
search icon

Document Information

Using This Documentation

Hardware Installation

Operating System Installation

Administration, Diagnostics, and Service

Diagnostics Guide

Overview of the Diagnostics Guide

Introduction to System Diagnostics

Troubleshooting Options

Diagnostic Tools

Troubleshooting the Server

How to Gather Service Visit Information

How to Troubleshoot Power Problems

How to Externally Inspect the Server

How to Internally Inspect the Server

Troubleshooting DIMM Problems

How to Isolate and Replace Faulty DIMM Pairs

How to Isolate and Correct Persistent DIMM Errors

Additional Tests

How to Detect DIMM Errors Using BIOS POST and the Oracle ILOM SEL

How to Detect DIMM Errors Using the CE Log

DIMM Hardware

BIOS POST

Default BIOS Power-On Self-Test (POST) Events

BIOS POST Errors

Using Oracle ILOM to Monitor the Host

Viewing the Oracle ILOM Sensor Readings

How to Use the Oracle ILOM Web Interface to View the Sensor Readings

How to Use the Oracle ILOM Command-Line Interface to View the Sensor Readings

Viewing Fault Status

How to View Fault Status Using the Oracle ILOM Web Interface

How to View Fault Status Using the Oracle ILOM Command-Line Interface

Clearing Faults

How to Clear Faults Using the Oracle ILOM Web Interface

How to Clear Faults Using the Oracle ILOM Command-Line Interface

Viewing the Oracle ILOM System Event Log

How to View the System Event Log Using the Oracle ILOM Web Interface

How to View the System Event Log With the Oracle ILOM Command-Line Interface

Clearing the System Event Log

How to Clear the System Event Log Using the Oracle ILOM Web Interface

How to Clear the System Event Log Using the Oracle ILOM Command-Line Interface

Interpreting Event Log Time Stamps

Resetting the SP

How to Reset the Oracle ILOM SP Using the Web Interface

How to Reset the Oracle ILOM SP Using the Command-Line Interface

Creating a Data Collector Snapshot

How to Create a Snapshot With the Oracle ILOM Web Interface

How to Create a Snapshot With the Oracle ILOM Command-Line Interface

Using SunVTS Diagnostics Software

Introduction to SunVTS Diagnostic Test Suite

SunVTS Log Files

SunVTS Documentation

How to Diagnose Server Problems With the Bootable Diagnostics CD

Performing Pc-Check Diagnostic Tests

Pc-Check Diagnostics Overview

How to Run Pc-Check Diagnostics

Pc-Check Main Menu

System Information Menu

Advanced Diagnostics

Burn-In Testing

Standard Scripts

How to Perform Immediate Burn-In Testing

How to Create and Save Scripts for Deferred Burn-in Testing

Viewing the Pc-Check Results

How to View Pc-Check Files With the Text File Editor

How to View Test Results Using Show Results Summary

How to Print the Results of Diagnostics Tests

U-Boot Diagnostic Start-Up Tests

U-Boot Test Options

Running the U-Boot Diagnostic Tests and Viewing the Results

How to Run the U-Boot Diagnostic Tests

U-Boot Diagnostic Test Output

Sample SP Environmental Variables Showing U-Boot Test Status

Index

How to Detect DIMM Errors Using the CE Log

This task creates a snapshot of the CE log, then uses the snapshot to locate memory errors.

BIOS updates the CE log every five minutes. After changing something, you must wait five minutes to ensure that the data in the CE log reflects the results of that change.

  1. Create a snapshot of the CE log.
    • Use the Oracle ILOM web interface or the CLI to create a snapshot, as described in Creating a Data Collector Snapshot.

    • The data set should be normal. If you are using the web interface, do not select any of the check boxes.

  2. Unzip the zip file created by the snapshot.

    The unzip action produces a directory ilom.

  3. Navigate to the ilom directory in the unzipped output.
  4. Search the file @persist@host_debug_err.log for uncorrectible errors.

    For example, you can search it for the string Uncorrectable.

    It should contain lines similar to the following:

    Mon May  8 02:54:12 2000 ID ffff V ECC No-UE CE Node 2 Branch 0 DIMM Pair(s): D0/D4
    Mon May  8 02:54:12 2000 ID ffff : 12 00 02 00 02 00 03 00 00 00 00 00 00 00 00 00
                                                16 | 00 00 00 00 00

    The following table shows the mapping of nodes to CMODs and CPUs. In the physical system, CMOD 0 is on the bottom, and CMOD 3 is on the top.


    Node
    CMOD
    CPU
    2
    3
    0
    3
    1
    6
    2
    0
    7
    1
    4
    1
    0
    5
    1
    0
    0
    0
    1
    1

    The preceding example identifies the faulty DIMM pair as DIMMs 0,3 in CPU 0 of CMOD 2.