A P P E N D I X  A

Diags Test Results

This section provides additional details about the diagnostics tests that might help to determine the cause of the problems. The details include annotated output of the tests, descriptions of the test algorithms, possible test failure cases, and additional troubleshooting suggestions.


Test Descriptions

This appendix describes the steps that a diagnostics test performs, and the components that might trigger a failure.

Voltage

The voltage limits test verifies that each voltage is within the high and low limits that are defined for that voltage.

Voltage Trimming



Note - If a given voltage supports trimming, the trimming function is exercised.



The voltage trim failure criteria is plus or minus 2 percent from the reading at nominal trim, with the exception of the low trim on the Bulk 3.3V S5 which is trimHi > trimNom > trimLo.

The limits test reads the initial trim setting that is associated with the voltage that is tested, before it reads the voltage from the analog-to-digital conversion (ADC).

If the voltage is within limits, the diagnostic reads the voltage again, with the trim set to nominal, then high, and finally low, and saves the value of each reading as a variable. The actual codes for nominal, low, and high settings vary among VRMs, the CPUs and the power supplies.

Voltage Read

If the nominal values of the monitored voltages is greater than 2.0 volts each, they are normalized to 2.0 volts by a divider network, before they are applied to the inputs of the ADC. The input range of the ADC is 0.0 to 2.5 Volts. The voltage calculation is:

Voltage  = reading *  2.5 / 4096 

The result then is scaled, based on the nominal voltage of the net that is tested. For example,for VCC_120_S0 (12 volts); the voltage (V) would be multiplied by 6.0. (The voltage was divided by 6 in the divider network to obtain the nominal 2.0 volts input.)

The voltage is read 5 (five) times and the result is the average of the 5 (five) readings.

Test Results

Non-Trim Voltage Passed

limits.bulk.v2_5-s0           88             Passed
    Test Details:
        Actual:              2.485
        Nominal:             2.500
        Maximum Limit:       2.625
        Minimum Limit:       2.375
        Sensor:              Bulk 2.5V S0 voltage (ID=bulk.v2_5-s0)
        Component(s):        Motherboard (ID=planar.vpd)

Trim Voltage Passed

limits.bulk.v3_3-s0           91             Passed
    Test Details:
        Actual Trim High:    3.400
        Actual Trim Nominal: 3.321
        Actual Trim Low::    3.237
        Nominal:             3.300
        Maximum Limit:       3.465
        Minimum Limit:       3.135
        Sensor:              Bulk 3.3V S0 voltage (ID=bulk.v3_3-s0)
        Component(s):        Motherboard (ID=planar.vpd)

Voltage Exceeds Limit Failure

limits.bulk.v1_8-s5           46             FAILED
    Failure Details:
        Failure:             Voltage exceeds maximum limit.
        Actual:              1.932
        Nominal:             1.800
        Maximum Limit:       1.890
        Minimum Limit:       1.710
        Sensor:              Bulk 1.8V S5 voltage (ID=bulk.v1_8-s5)
        Component(s):        Motherboard (ID=planar.vpd)

Hardware Failure

limits.bulk.v1_8-s5           46             FAILED
    Failure Details:
        Failure:             Unable to set voltage trim.  No such device or address. 
        Sensor:              Bulk 1.8V S5 voltage (ID=bulk.v1_8-s5)
        Component(s):        Motherboard (ID=planar.vpd)

Fan

This test verifies that the maximum speed of the fan is within spec and that the fan speed can be controlled.

At the start of the test, all fans are set to full on. This bypasses the internal control loop to drive the fans to full speed as rapidly as possible. When the fans have settled at the target speed for 24 seconds, the maximum speed is recorded. Next, the fans are set to run at the low speed. When the fans have settled at the target speed for 24 seconds, the low speed is recorded. The speed readings are compared to the upper and lower limits, to determine if the test failed.

The fans sometimes will approach the set-point asymptotically and hover slightly outside of the desired range. The tandem fans (fan0,1 fan2,3 fan 4,5) exhibit a characteristic increase in speed on the order of 1000 RPM in the downwind fan (fans1,3,5). Each fan controller has an internal clock that is specified at +/-(plus or minus) 10 percent, plus variations for temperature and voltage.

The allowable tolerances for determination of pass/fail currently allow a deviation of -10/+35% (minus 10 to plus 35 percent) for the high limits and -/+15% (minus 15 to plus 15 percent) for the low limits.

Fan Controller Programming

Before it manipulates the fan controller, the test saves the initial state of the fan controller, in order to restore this initial state after completion of the test.

Each controller manages two fans, a primary and a secondary. Each fan has a tachometer output that produces 2 (two) pulses-per-revolution. The master fan's tachometer output (tach0) is the feedback signal for the internal control loop of the controller. The slave fan's tachometer output (tach1) is used only for reading the speed of the slave fan. The controller supports open loop or closed-loop operation, as well as full on and off control.

Test Results

All results are from a Sun Fire V40z Server. A Sun Fire V20z Server has similar output, with fewer fan groups.

Fans Passed

speed.allFans                 2              Passed
    Test Details:        
        fan1.tach         Passed
            Controller:   fan-ctrl2
            High Rated:   8000
            High Actual:  7920
            High Delta:   -1.01%
            High Limits:  -10/+35%
            Low Setpoint: 6160
            Low Expected: 6098
            Low Actual:   6780
            Low Delta:    10.05%
            Low Limits:   -/+15%
            Sensor:       Fan 1 measured speed (ID=fan1.tach)
            Component(s): Fan 1 (ID=NA)
        fan2.tach         Passed
            Controller:   fan-ctrl2
            High Rated:   8000
            High Actual:  8580
            High Delta:   +6.76%
            High Limits:  -10/+35%
            Low Setpoint: 6160
            Low Expected: 6607
            Low Actual:   7320
            Low Delta:    9.75%
            Low Limits:   -/+15%
            Sensor:       Fan 2 measured speed (ID=fan2.tach)
            Component(s): Fan 2 (ID=NA)
        fan3.tach         Passed
            Controller:   fan-ctrl3
            High Rated:   8000
            High Actual:  8100
            High Delta:   +1.23%
            High Limits:  -10/+35%
            Low Setpoint: 6160
            Low Expected: 6237
            Low Actual:   6900
            Low Delta:    9.61%
            Low Limits:   -/+15%
            Sensor:       Fan 3 measured speed (ID=fan3.tach)
            Component(s): Fan 3 (ID=NA)
        fan4.tach         Passed
            Controller:   fan-ctrl3
            High Rated:   8000
            High Actual:  8760
            High Delta:   +8.68%
            High Limits:  -10/+35%
            Low Setpoint: 6160
            Low Expected: 6745
            Low Actual:   7320
            Low Delta:    7.85%
            Low Limits:   -/+15%
            Sensor:       Fan 4 measured speed (ID=fan4.tach)
            Component(s): Fan 4 (ID=NA)

High Speed Failure

speed.allFans                 1              FAILED
    Failure Details:
        fan1.tach         FAILED
            Failure:      fan1 is excessively fast at high speed setting; inlet air path may be obstructed.
            Controller:   fan-ctrl2
            High Rated:   8000
            High Actual:  10900
            High Delta:   +36.25%
            High Limits:  -10/+35%
            Low Setpoint: 6160
            Low Expected: 6329
            Low Actual:   6900
            Low Delta:    8.27%
            Low Limits:   -/+15%
            Sensor:       Fan 1 measured speed (ID=fan1.tach)
            Component(s): Fan 1 (ID=NA)
 
speed.allFans                 2              FAILED
    Test Details:        
        fan1.tach         FAILED
            Failure:      fan1 is too fast at low setting.
            Controller:   fan-ctrl2
            High Rated:   8000
            High Actual:  7920
            High Delta:   -1.01%
            High Limits:  -10/+35%
            Low Setpoint: 6160
            Low Expected: 6098
            Low Actual:   7200
            Low Delta:    16.88%
            Low Limits:   -/+15%
            Sensor:       Fan 1 measured speed (ID=fan1.tach)
            Component(s): Fan 1 (ID=NA)

Low Speed Failure

speed.allFans                 1              FAILED
    Failure Details:
        fan1.tach         FAILED
            Failure:      fan1 is too slow at low setting.
            Controller:   fan-ctrl2
            High Rated:   8000
            High Actual:  8760
            High Delta:   +8.68%
            High Limits:  -10/+35%
            Low Setpoint: 6160
            Low Expected: 6329
            Low Actual:   5200
            Low Delta:    -18.46%
            Low Limits:   -/+15%
            Sensor:       Fan 1 measured speed (ID=fan1.tach)
            Component(s): Fan 1 (ID=NA)
 
speed.allFans                 2              FAILED
    Failure Details:        
        fan1.tach         FAILED
            Failure:      fan1 is too slow at high setting.
            Controller:   fan-ctrl2
            High Rated:   8000
            High Actual:  7000
            High Delta:   -14.28%
            High Limits:  -10/+35%
            Low Setpoint: 6160
            Low Expected: 6098
            Low Actual:   6780
            Low Delta:    10.05%
            Low Limits:   -/+15%
            Sensor:       Fan 1 measured speed (ID=fan1.tach)
            Component(s): Fan 1 (ID=NA)

Memory

The memory tests are implemented as a loadable kernel module and a user-space component. The kernel module implements ioctl functions that actually perform most of the tests.



Note - For versions earlier than 2.x.x.x, ensure that interleaving is disabled in BIOS setup before you run the memory tests. For 2.x.x.x and higher versions, do not disable interleaving in BIOS.



March Test

Data is written to memory from bottom to top. First it writes all 0x0. Then, as it reads and checks the 0x0, it writes 0x5. As it reads and checks the 0x5, it writes 0x0. Then it starts reading from the top, moving toward the bottom. As it reads and checks for 0x0, it writes 0x5. As it reads and checks 0x5, it writes 0x0. Then it reads and checks the 0x0. This entire process is repeated, but 0x5 is replaced with 0xa.

RandAddr Test

Data is written to memory starting from the bottom, moving toward the top. The location of the memory data is written in the address space. Then, the test performs a random check on an address space. If the space does not contain data that is the same as the address, the test fails.

Retention Test

Data is written to memory starting from the bottom, moving toward the top. The test fills the memory, alternating between 0x5 and 0xa, to achieve a checkerboard layout in memory. It pauses for 100 ms, then reads and checks the memory for 0x5 and 0xa.

Test Results

Memory Tests Passed

march.allDimms                PF3            Passed
    Test Details:
        Memory Configuration: Total: 3072Mb
        CPU0-1024Mb CPU1-2048Mb
        CPU0: Width[128] Addr 0 - 3fffffff
         DIMM 0  256Mb Addr 0000000000 - 001fffffff Even Quad Word
         DIMM 1  256Mb Addr 0000000000 - 001fffffff Odd Quad Word
         DIMM 2  256Mb Addr 0020000000 - 003fffffff Even Quad Word
         DIMM 3  256Mb Addr 0020000000 - 003fffffff Odd Quad Word
 
randaddr.allDimms             PF4            Passed
    Test Details:
        Memory Configuration: Total: 3072Mb
        CPU0-1024Mb CPU1-2048Mb
        CPU0: Width[128] Addr 0 - 3fffffff
         DIMM 0  256Mb Addr 0000000000 - 001fffffff Even Quad Word
         DIMM 1  256Mb Addr 0000000000 - 001fffffff Odd Quad Word
         DIMM 2  256Mb Addr 0020000000 - 003fffffff Even Quad Word
         DIMM 3  256Mb Addr 0020000000 - 003fffffff Odd Quad Word
 
retention.allDimms            PF5            Passed
    Test Details:
        Memory Configuration: Total: 3072Mb
        CPU0-1024Mb CPU1-2048Mb
        CPU0: Width[128] Addr 0 - 3fffffff
         DIMM 0  256Mb Addr 0000000000 - 001fffffff Even Quad Word
         DIMM 1  256Mb Addr 0000000000 - 001fffffff Odd Quad Word
         DIMM 2  256Mb Addr 0020000000 - 003fffffff Even Quad Word
         DIMM 3  256Mb Addr 0020000000 - 003fffffff Odd Quad Word

Service Failure

retention.allDimms            PF1            FAILED
    Failure Details:
        Failure: Unable to load services.

ECC Failure

march.allDimms                1              FAILED
    Test Details:
        Failure: ECC ERROR @ Address:0x01a000e700:CPU1, DIMM - 2
CPU 1 DIMM 2 (ID=cpu1.mem2.vpd)
        Correctable, Syndrome 0x18, Multiple Errors Occurred
        CPU1 Function 3 ECC Registers:
00            MCA NB STAT LOW: 85080a13  MCA NB STAT HIGH: 85080a13
        Memory Configuration: Total: 7680Mb
        CPU0-2560Mb CPU1-5120Mb
        CPU0: Width[128] Addr 0 - 9fffffff
         DIMM 0 0256Mb Addr 0080000000 - 009fffffff Even Quad Word
         DIMM 1  256Mb Addr 0080000000 - 009fffffff Odd Quad Word
         DIMM 2 1024Mb Addr 0000000000 - 007fffffff Even Quad Word
         DIMM 3 1024Mb Addr 0000000000 - 007fffffff Odd Quad Word
        CPU1: Width[128] Addr a0000000 - 1dfffffff
         DIMM 0 2048Mb Addr 00a0000000 - 019fffffff Even Quad Word
         DIMM 1 2048Mb Addr 00a0000000 - 019fffffff Odd Quad Word
        *DIMM 2  512Mb Addr 01a0000000 - 01dfffffff Even Quad Word
         DIMM 3  512Mb Addr 01a0000000 - 01dfffffff Odd Quad Word

BIOS Setting Failure

march.allDimms                1              FAILED
    Test Details:
        Failure: Need to disable interleaving in BIOS setup before running memory tests.
Data Comparison Failure
march.allDimms                1              FAILED
    Test Details:
        Failure: Data Miscompare @ Addr 0x1a0000008, CPU 1 DIMM 3
        Expected   : [5555555555555555]
        Observed   : [5555555555505555]
        Difference : [50000]
 
        Memory Configuration: Total: 7168Mb
        CPU0-2560Mb CPU1-4608Mb
        CPU0: Width[128] Addr 0 - 9fffffff
         DIMM 0  256Mb Addr 0080000000 - 009fffffff Even Quad Word
         DIMM 1  256Mb Addr 0080000000 - 009fffffff Odd Quad Word
         DIMM 2 1024Mb Addr 0000000000 - 007fffffff Even Quad Word
         DIMM 3 1024Mb Addr 0000000000 - 007fffffff Odd Quad Word
        CPU1: Width[128] Addr a0000000 - 1bfffffff
         DIMM 0 2048Mb Addr 00a0000000 - 019fffffff Even Quad Word
         DIMM 1 2048Mb Addr 00a0000000 - 019fffffff Odd Quad Word
         DIMM 2  256Mb Addr 01a0000000 - 01bfffffff Even Quad Word
        *DIMM 3  256Mb Addr 01a0000000 - 01bfffffff Odd Quad Word
 
randaddr.allDimms             2              Passed
    Test Details:
        Memory Configuration: Total: 7168Mb
        CPU0-2560Mb CPU1-4608Mb
        CPU0: Width[128] Addr 0 - 9fffffff
         DIMM 0  256Mb Addr 0080000000 - 009fffffff Even Quad Word
         DIMM 1  256Mb Addr 0080000000 - 009fffffff Odd Quad Word
         DIMM 2 1024Mb Addr 0000000000 - 007fffffff Even Quad Word
         DIMM 3 1024Mb Addr 0000000000 - 007fffffff Odd Quad Word
        CPU1: Width[128] Addr a0000000 - 1bfffffff
         DIMM 0 2048Mb Addr 00a0000000 - 019fffffff Even Quad Word
         DIMM 1 2048Mb Addr 00a0000000 - 019fffffff Odd Quad Word
         DIMM 2  256Mb Addr 01a0000000 - 01bfffffff Even Quad Word
         DIMM 3  256Mb Addr 01a0000000 - 01bfffffff Odd Quad Word
 
retention.allDimms            3              FAILED
    Test Details:
        Failure: Data Miscompare @ Addr 0x1a0000008, CPU 1 DIMM 3
        Expected   : [5555555555555555]
        Observed   : [5555555555505555]
        Difference : [50000]
 
        Memory Configuration: Total: 7168Mb
        CPU0-2560Mb CPU1-4608Mb
        CPU0: Width[128] Addr 0 - 9fffffff
         DIMM 0  256Mb Addr 0080000000 - 009fffffff Even Quad Word
         DIMM 1  256Mb Addr 0080000000 - 009fffffff Odd Quad Word
         DIMM 2 1024Mb Addr 0000000000 - 007fffffff Even Quad Word
         DIMM 3 1024Mb Addr 0000000000 - 007fffffff Odd Quad Word
        CPU1: Width[128] Addr a0000000 - 1bfffffff
         DIMM 0 2048Mb Addr 00a0000000 - 019fffffff Even Quad Word
         DIMM 1 2048Mb Addr 00a0000000 - 019fffffff Odd Quad Word
         DIMM 2  256Mb Addr 01a0000000 - 01bfffffff Even Quad Word
        *DIMM 3  256Mb Addr 01a0000000 - 01bfffffff Odd Quad Word

NIC

The NIC phyLoop test performs a loopback test at the PHY. To accomplish this, the test sets the PHY loopback mode in the NIC device driver, initializes a 1500-byte data buffer with an incrementing byte pattern [0x00,0x01,0x02...0xff], and writes the data to the NIC. Next, the test initializes a second buffer with a fixed 0xe5 pattern and read 1500 bytes from the NIC. The test compares the data that is written from that read. Finally, the test sets the loopback mode off.

Test Results

NIC Passed

phyLoop.Nic.0                 PF1            Passed
    Test Details:
        Component(s): Motherboard (ID=planar.vpd)
phyLoop.Nic.1                 PF2            Passed
    Test Details:
        Component(s): Motherboard (ID=planar.vpd)    

Service Failure

Unable to load the driver bcm5700.
phyLoop.Nic.0                 PF1            FAILED
    Failure Details:
        Failure:      Unable to load service.
        Component(s): Motherboard (ID=planar.vpd)  

Link Down Failure

The link status of the device is down.
phyLoop.Nic.0                 PF1            FAILED
    Failure Details:
        Failure:      Link is down.
        Component(s): Motherboard (ID=planar.vpd)  

Link Setting Mismatch Failure

The link status of the device is mismatched.
phyLoop.Nic.0                 PF1            FAILED
    Failure Details:
        Failure:      Link setting mismatch.
        Component(s): Motherboard (ID=planar.vpd)  

Link Status Unknown Failure

The link status of the device is unknown.
phyLoop.Nic.0                 PF1            FAILED
    Failure Details:
        Failure:      Link status unknown.
        Component(s): Motherboard (ID=planar.vpd)  

Loopback Failure

The loopback is off.
phyLoop.Nic.0                 PF1            FAILED
    Failure Details:
        Failure:      Loopback is off.
        Component(s): Motherboard (ID=planar.vpd)  

Write Failure

Unable to write to loopback device.
phyLoop.Nic.0                 PF1            FAILED
    Failure Details:
        Failure:      Write error. Tried to write <X> bytes, only wrote <Y>.
        Component(s): Motherboard (ID=planar.vpd)  

Read Failure

Unable to read the loopback device.
phyLoop.Nic.0                 PF1            FAILED
    Failure Details:
        Failure:      Read error. Tried to read <X> bytes, only read <Y>.
        Component(s): Motherboard (ID=planar.vpd)  

Comparison Failure

A reading from the device is different from what was just written. The offset of the failure is the hex offset from the beginning of the data buffer. Expected and actual are the first hex byte that mismatched.

phyLoop.Nic.1                 PF2            FAILED
    Failure Details:
        Failure:      Compare error. At offset 343 expected 43, got bc.
        Component(s): Motherboard (ID=planar.vpd)  

Storage

The storage tests issue self-test commands to SCSI devices. There are both long and short forms of the self-test command. The execution time of the tests is a function of the device itself.

If the device is present, a SCSI subsystem control block that contains either the short or the long form of the Send Diagnostic command is passed to the SCSI driver for execution.

Test Results

Storage Passed

long.SCSI_0                   PF1            Passed
    Test Details:
        Device: SEAGATE  ST336607LC
        Version: 0004
        Serial number: 3JA0KJF6000073248EGM
        Device type: disk
        Component(s): Hard disk drive 0 (ID=NA)

Device Does Not Recognize Self-test Command Failure

The device does not recognize the self-test command.
short.SCSI_1                  PF2            FAILED
    Failure Details:          
        Failure:      Error starting DST background short test: Illegal Request       
        Component(s): Hard disk drive 1 (ID=NA)

Device Cannot Process Self-test Command Failure

The device is unable to accept and process self-test commands. The output includes the  SCSI sense key.
short.SCSI_1                  PF2            FAILED
    Failure Details:          
        Failure:      SCSI command failed: Sense Key[3]: Not Ready
        Component(s): Hard disk drive 1 (ID=NA)
The following is the list of sense keys.
No Sense
Recovery Data
Not Ready
Medium Error
Hardware Error
Illegal Request
Unit Attention
Data Protect
Blank Check
Vendor Specific
Copy Aborted
Volume Overflow
Miscompare
Reserved

Self-test Failure

The self-test command fails. The "Address of first failure" represents which segment of the vendor's test failed.

short.SCSI_1                  PF2            FAILED
    Failure Details:          
        Failure:      Test failed (Failing segment) Address of first failure{0x0).
        Component(s): Hard disk drive 1 (ID=NA)

Self-test Corruption Failure

This self-test command returned garbage.

short.SCSI_1                  PF2            FAILED
    Failure Details:          
        Failure:      Invalid Self-Test Results Page Returned by System.
        Component(s): Hard disk drive 1 (ID=NA)

Flash Memory

The flash memory diagnostic verifies that the SP flash memory can be written. Each iteration of the diagnostic flips 2 bits (1 in each chip) from their initial erased state (1) to a 0. Eventually, all of the "1" bits in the diagnostic area of the flash are "used up" and the diagnostic sector of the flash is erased during the test, before it writes the 2 bits.

Test Results

Most of the possible failures are related to difficulties accessing the flash part through the MTD driver. These are unlikely to occur and most likely point to a software problem. If these persist, the first attempt at remediation is to erase and re-program the entire flash of the Service Processor. See the Systems Management Guide for information about how to update the Service Processor.

Flash Memory Passed

write.flash                   2              Passed
    Test Details:
        Component(s): Motherboard (ID=planar.vpd)

Open System Failure

Unable to open the flash sector for read/write access.

write.flash                   1              FAILED
    Failure Details:
        Failure:      Unable to open flash driver: <errno string>
        Component(s): Motherboard (ID=planar.vpd)

Read System Failure

Unable to read the flash sector.

write.flash                   1              FAILED
    Failure Details:
        Failure:      Unable to read flash memory: <errno string>
        Component(s): Motherboard (ID=planar.vpd)

Access Device Failure

Unable to determine the size of the flash sector.

write.flash                   1              FAILED
    Failure Details:
        Failure:      Can't determine erase size of device: <errno string>
        Component(s): Motherboard (ID=planar.vpd)

Write System Failure

Unable to write to the flash sector.

write.flash                   1              FAILED
    Failure Details:
        Failure:      Unable to write flash memory: <errno string>
        Component(s): Motherboard (ID=planar.vpd)

Erase Failure

Unable to erase the flash sector. This error could indicate a defective part or other hardware error.

write.flash                   1              FAILED
    Failure Details:
        Failure:      Erase operation failure: <errno string>
        Component(s): Motherboard (ID=planar.vpd)

Magic Number Failure

The magic number at offset 0x0 of the flash reserved partition is not correct. Expected value is 0x44494147 or 0xffffffff (erased). This could indicate that some process inadvertently wrote to the diagnostics area. Re-flash the Service Processor and re-test. See the Systems Management Guide for information about how to update the Service Processor.

write.flash                   1              FAILED
    Failure Details:
        Failure:      Magic number of diagnostics area incorrect, Expected [0x44494147], Actual [0xNNNNNNNN].
        Component(s): Motherboard (ID=planar.vpd)

Data Comparison Failure

A reading of the flash sector is different from what was just written. This is probably a hardware failure. Re-flash the Service Processor and re-test. See the Systems Management Guide for information about how to update the Service Processor.

write.flash                   1              FAILED
    Failure Details:
        Failure:      Data Miscompare: Expected [0xNNNNNNNN], Actual [0xNNNNNNNN].
        Component(s): Motherboard (ID=planar.vpd)

LED

This group of tests verifies functionality of the LED drivers. The test consists of reading the bit of interest in the I2C-connected LED driver chip, inverting the value of that bit, writing it to the chip, reading the new value, verifying that it is indeed toggled, writing the initial value and again verifying that it is correct. The LED does not change to indicate this behavior.

Test Results

The following examples are from a 4300 server. Although a 2100 server has fewer and different LED components, the test output is similar. The examples comprise a condensed version of the report.

LED Passed

toggleLED.allLeds             3              Passed         
    Test Details:        
        cd                Passed
            Sensor:       CDROM Light path location LED (ID=cd.lp)
            Component(s): CD ROM drive (ID=NA)
        cpu0              Passed
            Sensor:       CPU 0 Light path location LED (ID=cpu0.lp)
            Component(s): CPU 0 (ID=cpu0.vpd)
        cpu0.mem0         Passed
            Sensor:       CPU 0 Dimm 0 Light path location LED (ID=cpu0.mem0.lp)
            Component(s): CPU 0 DIMM 0 (ID=cpu0.mem0.vpd)
        cpu0.mem1         Passed
            Sensor:       CPU 0 Dimm 1 Light path location LED (ID=cpu0.mem1.lp)
            Component(s): CPU 0 DIMM 1 (ID=cpu0.mem1.vpd)
        cpu0.mem2         Passed
            Sensor:       CPU 0 Dimm 2 Light path location LED (ID=cpu0.mem2.lp)
            Component(s): CPU 0 DIMM 2 (ID=cpu0.mem2.vpd)
        cpu0.mem3         Passed
            Sensor:       CPU 0 Dimm 3 Light path location LED (ID=cpu0.mem3.lp)
            Component(s): CPU 0 DIMM 3 (ID=cpu0.mem3.vpd)
        cpu0.memvrm       Passed
            Sensor:       CPU 0 Memory VRM Light path location LED (ID=cpu0.memvrm.lp)
            Component(s): CPU 0 memory VRM (ID=cpu0.memvrm.vpd)
        cpu0.vrm          Passed
            Sensor:       CPU 0 VRM Light path location LED (ID=cpu0.vrm.lp)
            Component(s): CPU 0 VRM (ID=cpu0.vrm.vpd)
        cpuplanar         Passed
            Sensor:       Daughtercard Light path location LED (ID=cpuplanar.lp)
            Component(s): CPU Daughter Card (ID=cpuplanar.vpd)
        fault             Passed
            Sensor:       System Fault Indication (ID=faultswitch)
            Component(s): Fault light (ID=NA)
        floppy            Passed
            Sensor:       Floppy Light path location LED (ID=floppy.lp)
            Component(s): Floppy disk drive (ID=NA)
        oppanel           Passed
            Sensor:       LCD Light path location LED (ID=frontpanel.lp)
            Component(s): Front panel (ID=pic.vpd)
        identify          Passed
            Sensor:       Identify switch (ID=identifyswitch)
            Component(s): Identify light (ID=NA)
        front-fans        Passed
            Sensor:       Fan Board Light path location LED (ID=pcifan.lp)
            Component(s): Front Fan backplane (ID=NA)
        planar            Passed
            Sensor:       Motherboard Light path location LED (ID=planar.lp)
            Component(s): Motherboard (ID=planar.vpd)
        disk-backplane    Passed
            Sensor:       SCSI Backplane Light path location LED (ID=scsibp.lp)
            Component(s): SCSI backplane (ID=scsibp.vpd)

Device Not Present Warning

toggleLED.allLeds             3              Warning 
    Test Details:        
        cd                Not Present
            Sensor:       CDROM Light path location LED (ID=cd.lp)
            Component(s): CD ROM drive (ID=NA)

Read Failure

Unable to read the device.
toggleLED.allLeds             3              FAILED         
    Test Details:        
        planar            FAILED
            Failure:      Unable to read LED. <errno string>
            Sensor:       Motherboard Light path location LED (ID=planar.lp)
            Component(s): Motherboard (ID=planar.vpd)

Write Failure

Unable to write to the device.
toggleLED.allLeds             3              FAILED         
    Test Details:        
        planar            FAILED
            Failure:      Unable to write to LED. <errno string>
            Sensor:       Motherboard Light path location LED (ID=planar.lp)
            Component(s): Motherboard (ID=planar.vpd)

Temperature

Each temperature sensor is initialized and the current temperature is read from the device. The temperature then is compared against critical and warning thresholds. If a threshold is exceeded, a failure is indicated.

Use the SP command sensor get to view the current threshold settings for a given temperature.

localhost # sensor get -i cpu0.temp -cwWC
Identifier   Crit Low Warn Low Warn High Crit High
cpu0.memtemp NA       NA          68.00     70.00

Test Results

If the temperature is within allowable ranges, the reading is displayed in degrees Celsius.

Temperature Passed

read.cpu0.memtemp             1            Passed
    Test Details:
        Temperature:  67.3
        Sensor:       CPU 0 Memory temperature (ID=cpu0.memtemp)
        Component(s): Motherboard (ID=planar.vpd)

Reading Exceeds Warning Threshold Passed

The temperature reading is either above or below the warning threshold.

read.ambient.temp                2              Passed
    Test Details:
        Temperature:  26.8
        Warning:      Temperature exceeds the warning threshold of 22.0, but is still safe.
        Sensor:       Ambient air temp (ID=ambienttemp)
        Component(s): Box (enclosure) (ID=NA)

Device Not Present Warning

The device is not present.

Read.cpu0.memtemp             1            Warning
    Failure Details:
        Failure:      Device not present.
        Sensor:       CPU 0 Memory temperature (ID=cpu0.memtemp)
        Component(s): Motherboard (ID=planar.vpd)

Read Failure

Unable to read the device.

read.cpu0.memtemp             1            FAILED
    Failure Details:
       Failure:     Unable to read device temperature.
        Sensor:       CPU 0 Memory temperature (ID=cpu0.memtemp)
        Component(s): Motherboard (ID=planar.vpd)

Reading Exceeds Critical Threshold Failure

The temperature reading is either above or below the critical threshold.

read.cpu0.temp                2              FAILED
    Test Details:
        Failure:      Sensor is below critical threshold: 29.2 < 30.0     
        Sensor:       CPU 0 temperature (ID=cpu0.dietemp)
        Component(s): CPU 0 (ID=cpu0.vpd)
 
read.cpu1.temp                23             FAILED
    Test Details:
        Failure:      Sensor exceeds critical threshold: 30.0 > 29.2     
        Sensor:       CPU 0 temperature (ID=cpu1.dietemp)
        Component(s): CPU 0 (ID=cpu1.vpd)

Operator Panel

The Operator Panel test module reads and saves the current contents of the display buffer. Then it performs a write/read/compare of five different data patterns (0xFF 0xAA 0x55 0x66 0x99) to the display buffer. After the test is complete, the initial contents of the display are restored.

Test Results

OpPanel Passed

write.opPanel                 1              Passed
    Test Details:
        Sensor:       Operator Panel virtual device (ID=oppanel)
        Component(s): Front panel

Read Failure

Unable to read the display buffer.

write.opPanel                 2              FAILED
    Failure Details:
        Failure:      Unable to read OpPanel. <errno string>
        Sensor:       Operator Panel virtual device (ID=oppanel)
        Component(s): Front panel

Write Failure

Unable to write to the display buffer.

write.opPanel                 2              FAILED
    Failure Details:
        Failure:      Unable to write to OpPanel. <errno string>
        Sensor:       Operator Panel virtual device (ID=oppanel)
        Component(s): Front panel

Data Comparison Failure

Data in the display buffer is different from what was just written.

write.opPanel                 2              FAILED
    Failure Details:
        Failure:      Compare failed at line 2, char 12. Expected AA and got 23.
        Sensor:       Operator Panel virtual device (ID=oppanel)
        Component(s): Front panel

Power Supply

The power test module is only available on Sun Fire V40z Servers that have dual power supplies.

The power supply test verifies the presence of each power supply and reads the status register on the power back plane. If a supply is present, the enabled and power good status bits of the PRS are read. The VPD for that supply is read and the checksum is verified. For each supply, a pass status is determined by these criteria:

If supply is not present, absence of alert status or presence of a power good status is considered an error. A missing power supply does not constitute a failure, but is a warning. Power supply status for each power supply is either "Passed," "FAILED," or "Warning." The NPUI (Net Power Usage Indicator) on the power supply motherboard is read to determine the present power consumption of each supply. Each supply produces an output signal. The voltage of this signal is proportional to the power consumption of that supply. This voltage is an input to a PCF8591 dual ADC on the power supply motherboard. The converted value is read from this and used to calculate the power consumption.

Test Results

Power Passed

read.allPowerSupplies         9              Passed
    Test Details:
        Power Supply 1 Status:   Passed
            Presence Detect:     Present
            Enabled:             True
            Power Good:          True
            Part Number:         S00440
            ECN:                 A01
            Serial Number:       PM16768
            Manufacturer:        CHEROKEE
            Date of Manufacture: 12-24-03
            Component(s):        Power supply 1 (ID=ps1.vpd)
        Power Supply 2 Status:   Not Present

Power Good Failure

read.allPowerSupplies         9              FAILED
    Failure Details:
        Power Supply 1 Status:   Power good indicator is false.
            Presence Detect:     Present
            Enabled:             True
            Power Good:          False
            Part Number:         S00440
            ECN:                 A01
            Serial Number:       PM16768
            Manufacturer:        CHEROKEE
            Date of Manufacture: 12-24-03
            Component(s):        Power supply 1 (ID=ps1.vpd)
        Power Supply 2 Status:   Passed
            Presence Detect:     Present
            Enabled:             True
            Power Good:          True
            Part Number:         S00440
            ECN:                 A01
            Serial Number:       PM16769
            Manufacturer:        CHEROKEE
            Date of Manufacture: 12-24-03
            Component(s):        Power supply 2 (ID=ps2.vpd)

Read Failure

read.allPowerSupplies         9              FAILED
    Failure Details:
        Failure: Unable to read device. (Power supply 2)