The Oracle Solaris PSH technology enables the server module to diagnose problems while the Oracle Solaris OS is running and to mitigate many problems before they negatively affect operations.
The Oracle Solaris OS uses the fault manager daemon, fmd(1M), which starts at boot time and runs in the background to monitor the server module. If a component generates an error, the daemon correlates the error with data from previous errors and other relevant information to diagnose the problem. Once diagnosed, the fault manager daemon assigns a UUID to the error. This value distinguishes this error across any set of server modules.
When possible, the fault manager daemon initiates steps to self-heal the failed component and take the component offline. The daemon also logs the fault to the syslogd daemon and provides a fault notification with a message ID (sometimes labeled MSG-ID). You can use the message ID to get additional information about the problem from the knowledge article database.
The PSH technology covers the following server module components:
The PSH console message provides the following information about each detected fault:
Suggested action for a system administrator
If PSH detects a faulty component, use the fmadm faulty command to display information about the fault. Alternatively, you can use the Oracle ILOM command show faulty for the same purpose.