Oracle ILOM enables you to remotely run diagnostics, such as POST, that would otherwise require physical proximity to the server's serial port. You can also configure Oracle ILOM to send email alerts of hardware failures, hardware warnings, and other events related to the server or to Oracle ILOM.
The SP runs independently of the server, using the server's standby power. Therefore, Oracle ILOM firmware and software continue to function when the server OS goes offline or when the server is powered off.
Figure 38 Fault Reporting Through the Oracle ILOM Fault Manager
The Oracle ILOM fault manager evaluates error messages the manager receives to determine whether the condition being reported should be classified as an alert or a fault.
Alerts -- When the fault manager determines that an error condition being reported does not indicate a faulty FRU, the fault manager classifies the error as an alert.
Alert conditions are often caused by environmental conditions, such as computer room temperature, which might improve over time. Alerts might also be caused by a configuration error, such as the wrong DIMM type being installed.
If the conditions responsible for the alert go away, the fault manager detects the change and stops logging alerts for that condition.
Faults -- When the fault manager determines that a particular FRU has an error condition that is permanent, that error is classified as a fault. This classification causes the Service Required LEDs to be turned on, the FRUID PROMs updated, and a fault message logged. If the FRU has status LEDs, the Service Required LED for that FRU is also turned on.
A FRU identified as having a fault condition must be replaced.
The SP can automatically detect when a FRU has been replaced. In many cases, the SP performs this action even if the FRU is removed while the system is not running (for example, if the system power cables are unplugged during service procedures). This function enables Oracle ILOM to sense that a fault, diagnosed to a specific FRU, has been repaired.
Note - Oracle ILOM does not automatically detect hard drive replacement.
PSH does not monitor hard drives for faults. As a result, the SP does not recognize hard drive faults and does not light the fault LEDs on either the chassis or the hard drive itself. Use the Oracle Solaris message files to view hard drive faults.
For general information about Oracle ILOM, refer to the Oracle ILOM documentation.
For detailed information about Oracle ILOM features that are specific to this server, refer to Server Administration.