ILOM enables you to remotely run diagnostics such as power-on self-test (POST), that would otherwise require physical proximity to the server’s serial port. You can also configure ILOM to send email alerts of hardware failures, hardware warnings, and other events related to the server or to ILOM.
The service processor runs independently of the server, using the server’s standby power. Therefore, ILOM firmware and software continue to function when the server OS goes offline or when the server is powered off.
Figure 55 Fault Reporting Through the ILOM Fault Manager
In the event of a system fault, ILOM ensures that the Service Required LED is turned on, FRUID PROMs are updated, the fault is logged, and alerts are displayed. Faulty FRUs are identified in fault messages using the FRU name.
The service processor can detect when a fault is no longer present. When this happens, it clears the fault state in the FRU PROM and extinguishes the Service Required LED.
A fault condition can be removed in two ways:
Many environmental faults can automatically recover. For example, a temporary condition may cause the computer room temperature to rise above the maximum threshold, producing an over temperature fault in the server. If the computer room temperature then returns to the normal range and the server’s internal temperature also drops back to an acceptable level, the service processor will detect the new fault-free condition. It will extinguish the Service Required LED and clear the fault state from the FRU PROM.
Note - No ILOM command is needed to manually repair an environmental fault.
The service processor can automatically detect when a FRU is removed. In many cases, it does this even if the FRU is removed while the service processor is not running.
Note - If the service processor does not automatically clear a fault state after the fault is corrected, you must perform these tasks manually. The procedure for clearing faults manually is described in Clear FRU Faults Manually.
Note - ILOM does not automatically detect hard drive replacement.
The Solaris Predictive Self-Healing technology does not monitor hard drives for faults. As a result, the service processor does not recognize hard drive faults and will not light the fault LEDs on either the chassis or the hard drive itself. Use the Solaris message files to view hard drive faults.
For general information about ILOM, see the Oracle Integrated Lights Out Manager (ILOM) 3.0 Concepts Guide.
For detailed information about ILOM features that are specific to this server, see the Oracle Integrated Lights Out Manager (ILOM) 3.0 Supplement for SPARC Enterprise T5140 and T5240 Servers.
Before you can run ILOM commands, you must connect to the service processor. You can do this using either of the following methods:
Connect an ASCII terminal directly to the serial management port.
Use the ssh command to connect to service processor through an Ethernet connection on the network management port.
Note - Refer to the Oracle Integrated Lights Out Manager (ILOM) 3.0 Supplement for SPARC Enterprise T5140 and T5240 Servers for instructions on configuring and connecting to ILOM.
The following example shows the character sequence (hash-period) used to change from the system console to the service processor:
The following example showiest ILOM command for changing from the service processor to the system console:
-> start /SP/console
If you are connected to the service processor using the ALOM CMT shell, use the following command to change to the system console: