You can use a variety of diagnostic tools, commands, and indicators to monitor and troubleshoot a server:
LEDs – Provide a quick visual notification of the status of the server and of some of the FRUs.
Oracle ILOM 3.0 – Runs on the SP. In addition to providing the interface between the hardware and OS, Oracle ILOM also tracks and reports the health of key server components. Oracle ILOM works closely with POST and PSH to keep the system running even when there is a faulty component.
Power-on self-test (POST) – Performs diagnostics on system components upon system reset to ensure the integrity of those components. POST is configurable and works with Oracle ILOM to take faulty components offline if needed.
PSH - Continuously monitors the health of the CPU, memory, and other components, and works with Oracle ILOM to take a faulty component offline if needed. The PSH technology enables systems to accurately predict component failures and mitigate many serious problems before they occur.
Log files and command interface – Provide the standard Oracle Solaris OS log files and investigative commands that can be accessed and displayed on the device of your choice.
Oracle VTS – Exercises the system, provides hardware validation, and discloses possible faulty components with recommendations for repair.
The LEDs, Oracle ILOM, PSH, and many of the log files and console messages are integrated. For example, when the Oracle Solaris OS detects a fault, the software displays the fault, logs the fault, and passes the information to Oracle ILOM, where the fault is also logged. Depending on the fault, one or more LEDs might also be illuminated.
The diagnostic flowchart in Diagnostics Process illustrates an approach for using the server diagnostics to identify a faulty FRU. The diagnostics you use, and the order in which you use them, depend on the nature of the problem you are troubleshooting. So you might perform some actions and not others.