Troubleshooting the Server Module
Troubleshooting Server Module Hardware Faults
Service Troubleshooting Task List
Troubleshooting and Diagnostic Information
Locate the Server Module Serial Number
Troubleshoot Server Module Power Problems
About the DIMM and Processor Test Circuit
Recover the SP Firmware Using the Preboot Menu (Service Only)
Servicing Server Module Components
Returning the Server Module to Operation
BIOS Power-On Self-Test (POST) Checkpoints
There are a variety of diagnostic tools, commands, and indicators you can use to monitor and troubleshoot the server:
LEDs – These indicators provide a quick visual notification of the status of the server and of some of the CRUs and FRUs.
Oracle ILOM firmware – Firmware is located on the service processor and provides a comprehensive service portal via a command-line interface (CLI) and browser user interface (BUI) for lights-out management capabilities (remote power-on, power-off), monitoring of the health of environmental subsystems (power, fans, temperature, interlock), and fault management and automated diagnosis capabilities during server initialization (QuickPath Interconnect code and Memory Reference code), and runtime of the server.
Diagnostics – Accessed through Oracle ILOM, the DOS-based Pc-Check utility tests motherboard components such as processor, memory and I/O, as well as ports and slots. If enabled through Oracle ILOM, this utility will run each time the system powers on. For information about Pc-Check, refer to the Oracle x86 Servers Diagnostics, Applications, and Utilities Guide for Servers With Oracle ILOM 3.1 at http://www.oracle.com/goto/x86AdminDiag/docs.
POST – Power-on self-test (POST) performs diagnostics on system components upon system power-on and resets to ensure the integrity of those components. POST messages are displayed and logged in the BIOS event logs. POST works with Oracle ILOM to take faulty components offline, if needed.
SNMP – Simple Network Management Protocol traps are generated by the SNMP agents that are installed on the SNMP devices being managed by Oracle ILOM. Oracle ILOM receives the SNMP traps and converts them into SNMP event messages that appear in the event log.
Oracle Solaris OS Diagnostic Tools
Oracle Solaris OS Predictive Self-Healing (PSH) – The PSH technology provides automated diagnosis of error events encountered with the processor, memory subsystem, and Integrated I/O subsystem during runtime. The ability of PSH to off-line faulty processors and retire memory pages during runtime enhances system availability and prevents future interruptions. The Solaris PSH technology, ILOM, and BIOS provide extensive fault management architecture for placing processors offline and disabling of DIMMs.
Log files and console messages – These items provide the standard Solaris OS log files and investigative commands that can be accessed and displayed on the device of your choice.
Oracle VTS software – This application exercises the system, provides hardware validation, and discloses possible faulty components with recommendations for repair.
The LEDs, Oracle ILOM, Oracle Solaris OS PSH, and many of the log files and console messages are integrated. For example, Oracle Solaris software will display a detected fault, log it, pass information to Oracle ILOM, where it will be logged, and depending on the fault, might cause one or more LEDs to light.
Related Information: