Server Module (Blade) Faults

Fault management can detect the following faults on hardware and/or the environment of the server modules (blades):

fault.cpu.heatsink@bl/sys/p

  • Fault cause:

    • A CPU's temperature has reached or exceeded 82oC (179.6oF), and

    • the blade's ambient temperature is within acceptable limits (less than or equal to 35oC (95oF)) , and

    • all of the rear fan modules are operating normally.

  • Action in response to fault:

    • The blade's Service Action Required LED is lit.

    • The chassis Service Action Required LEDs are lit.

    • The ILOM management interfaces are updated to reflect the fault.

    • The fault is recorded in the event log.

    • The blade's SP attempts a graceful shutdown of the blade. The host has two minutes in which to shut down gracefully; after two minutes, the SP forces an immediate shutdown.

  • Fault clearing:

    • The blade must be replaced or repaired (the failed heatsink replaced).

fault.memory.dimm_ue@bl/sys/p/d

  • Fault cause:

    • BIOS POST (or Pc-Check) has encountered an uncorrectable ECC error on a DIMM.

  • Action in response to fault:

    • The blade's Service Action Required LED is lit.

    • The chassis Service Action Required LEDs are lit.

    • The ILOM management interfaces are updated to reflect the fault.

    • The fault is recorded in the event log.

  • Fault clearing:

    • The DIMM on the blade must be replaced or repaired, or an operator must manually clear the fault.

fault.memory.dimm_ce@bl/sys/p/d

  • Fault cause:

    • BIOS POST (or Pc-Check) has encountered a correctable ECC error on a DIMM.

  • Action in response to fault:

    • The blade's Service Action Required LED is lit.

    • The chassis Service Action Required LEDs are lit.

    • The ILOM management interfaces are updated to reflect the fault.

    • The fault is recorded in the event log.

  • Fault clearing:

    • The blade (DIMM) must be replaced or repaired, or an operator must manually clear the fault.

fault.bios.no_memory@bl/sys

  • Fault cause:

    • BIOS POST cannot find memory for this blade host.

  • Action in response to fault:

    • The blade's Service Action Required LED is lit.

    • The chassis Service Action Required LEDs are lit.

    • The ILOM management interfaces are updated to reflect the fault.

    • The fault is recorded in the event log.

    • The blade host will not boot.

  • Fault clearing:

    • The blade must be replaced or repaired, or an operator must manually clear the fault.

fault.bios.keyboard@bl/sys

  • Fault cause:

    • BIOS POST is unable to initialize the keyboard.

  • Action in response to fault:

    • The blade's Service Action Required LED is lit.

    • The chassis Service Action Required LEDs are lit.

    • The ILOM management interfaces are updated to reflect the fault.

    • The fault is recorded in the event log.

  • Fault clearing:

    • The blade must be replaced or repaired, or an operator must manually clear the fault.

fault.bios.video@bl/sys

  • Fault cause:

    • BIOS POST is unable to find a video controller.

  • Action in response to fault:

    • The blade's Service Action Required LED is lit.

    • The chassis Service Action Required LEDs are lit.

    • The ILOM management interfaces are updated to reflect the fault.

    • The fault is recorded in the event log.

    • The blade host will not boot.

  • Fault clearing:

    • The blade must be replaced or repaired, or an operator must manually clear the fault.

fault.bios.rom.corrupt@bl/sys

  • Fault cause:

    • BIOS POST detects a firmware ROM corruption.

  • Action in response to fault:

    • The blade's Service Action Required LED is lit.

    • The chassis Service Action Required LEDs are lit.

    • The ILOM management interfaces are updated to reflect the fault.

    • The fault is recorded in the event log.

    • The blade host will not boot.

  • Fault clearing:

    • The blade must be replaced or repaired, or an operator must manually clear the fault.

fault.power.current@bl/sys

  • Fault cause:

    • One of three current sensors on the blade is reading a low or zero value.

  • Action in response to fault:

    • Check the current sensors on the blade using either the ILOM web interface, command-line interface, or IPMItool:

      /SYS/I_0_+48V | 2.040 | Amps

      /SYS/I_1_+48V | 1.860 | Amps

      /SYS/I_2_+48V | 0.0 | Amps

  • The chassis Service Action Required LEDs are lit.

  • The fault is recorded in the event log.

  • The blade Service Action Required LED is lit.

Fault clearing:

  • The blade must be replaced. The blade will continue to function until it is replaced.