The Oracle ILOM firmware enables you to remotely run diagnostics, such as POST, that would otherwise require physical proximity to the server's serial port. You can also configure Oracle ILOM to send email alerts of hardware failures, hardware warnings, and other events related to the server or to Oracle ILOM.
The SP runs independently of the server, using the server's standby power. Therefore, Oracle ILOM firmware and software continue to function when the server OS goes offline or when the server is powered off.
Figure 16 Fault Reporting Through the Oracle ILOM Fault Manager
The Oracle ILOM fault manager evaluates error messages it receives to determine whether the condition being reported should be classified as an alert or a fault.
Alerts – When the fault manager determines that an error condition being reported does not indicate a faulty FRU, it classifies the error as an alert.
Alert conditions are often caused by environmental conditions, such as computer room temperature, which may improve over time. They may also be caused by a configuration error, such as the wrong DIMM type being installed.
If the conditions responsible for the alert go away, the fault manager will detect the change and will stop logging alerts for that condition.
Faults – When the fault manager determines that a particular FRU has an error condition that is permanent, that error is classified as a fault. This causes the Service Required LEDs to be turned on, the FRUID PROMs updated, and a fault message logged. If the FRU has status LEDs, the Service Required LED for that FRU will also be turned on.
A FRU identified as having a fault condition must be replaced.
The SP can automatically detect when a FRU has been replaced. In many cases, it does this even if the FRU is removed while the system is not running (for example, if the system power cables are unplugged during service procedures). This function enables Oracle ILOM to sense that a fault, diagnosed to a specific FRU, has been repaired.
Note - Oracle ILOM does not automatically detect hard drive replacement.
The Oracle Solaris PSH technology does not monitor hard drives for faults. As a result, the SP does not recognize hard drive faults and will not light the fault LEDs on either the chassis or the hard drive itself. Use the Oracle Solaris message files to view hard drive faults.
For general information about Oracle ILOM, see the Oracle Integrated Lights Out Manager (ILOM) 3.0 Concepts Guide.
For detailed information about Oracle ILOM features that are specific to this server, see the SPARC T4 Series Servers Administration Guide.