Sun Netra T5440 Server

Exit Print View

Updated: September 2015
 
 

1.6 Using the Solaris Predictive Self-Healing Feature

The Solaris Predictive Self-Healing (PSH) technology enables the server to diagnose problems while the Solaris OS is running, and mitigate many problems before they negatively affect operations.

The Solaris OS uses the fault manager daemon, fmd(1M), which starts at boot time and runs in the background to monitor the system. If a component generates an error, the daemon handles the error by correlating the error with data from previous errors and other related information to diagnose the problem. Once diagnosed, the fault manager daemon assigns the problem a Universal Unique Identifier (UUID) that distinguishes the problem across any set of systems. When possible, the fault manager daemon initiates steps to self-heal the failed component and take the component offline. The daemon also logs the fault to the syslogd daemon and provides a fault notification with a message ID (MSGID). You can use the message ID to get additional information about the problem from Sun's knowledge article database.

The Predictive Self-Healing technology covers the following server components:

  • UltraSPARC T2 multicore processor

  • Memory

  • I/O bus

The PSH console message provides the following information:

  • Type

  • Severity

  • Description

  • Automated response

  • Impact

  • Suggested action for system administrator

If the Solaris PSH facility detects a faulty component, use the fmdump command to identify the fault. Faulty FRUs are identified in fault messages using the FRU name.


Note - Additional Predictive Self-Healing information is available at: http://www.sun.com/msg