Oracle ILOM enables you to remotely run diagnostics such as power-on self-test (POST), that would otherwise require physical proximity to the server’s serial port. You can also configure Oracle ILOM to send email alerts of hardware failures, hardware warnings, and other events related to the server or to Oracle ILOM.
The service processor runs independently of the server, using the server’s standby power. Therefore, Oracle ILOM firmware and software continue to function when the server OS goes offline or when the server is powered off.
Note - Refer to the Oracle Integrated Lights Out Manager 3.0 Concepts Guide for comprehensive Oracle ILOM information.
Faults detected by Oracle ILOM, POST, the Predictive Self-Healing (PSH) technology, and the External IO Expansion Unit (if attached) are forwarded to Oracle ILOM for fault handling (Oracle ILOM Fault Management).
In the event of a system fault, Oracle ILOM ensures that the Service Required LED is lit, FRUID PROMs are updated, the fault is logged, and alerts are displayed. Faulty FRUs are identified in fault messages using the FRU name.
Figure 15 Oracle ILOM Fault Management
The service processor can detect when a fault is no longer present and clears the fault in several ways:
Fault recovery – The system automatically detects that the fault condition is no longer present. The service processor extinguishes the Service Required LED and updates the FRU’s PROM, indicating that the fault is no longer present.
Fault repair – The fault has been repaired by human intervention. In most cases, the service processor detects the repair and extinguishes the Service Required LED. If the service processor does not perform these actions, you must perform these tasks manually by setting the Oracle ILOM component_state or fault_state of the faulted component.
The service processor can detect the removal of a FRU, in many cases even if the FRU is removed while the service processor is powered off (for example, if the system power cables are unplugged during service procedures). This function enables Oracle ILOM to know that a fault, diagnosed to a specific FRU, has been repaired.
Note - Oracle ILOM does not automatically detect hard drive replacement.
Many environmental faults can automatically recover. A temperature that is exceeding a threshold might return to normal limits. An unplugged power supply can be plugged in, and so on. Recovery of environmental faults is automatically detected.
Note - No Oracle ILOM command is needed to manually repair an environmental fault.
The Predictive Self-Healing technology does not monitor the hard drive for faults. As a result, the service processor does not recognize hard drive faults, and will not light the fault LEDs on either the chassis or the hard drive itself. Use the Oracle Solaris message files to view hard drive faults.
Sun SPARC Enterprise T5440 Server Installation and Setup Guide
Sun SPARC Enterprise T5440 Server Administration Guide
Oracle Integrated Lights Out Manager 3.0 Supplement for the Sun SPARC Enterprise T5440 Server