Go to main content

Managing Faults, Defects, and Alerts in Oracle® Solaris 11.3

Exit Print View

Updated: March 2018
 
 

Receiving Notification of Faults, Defects, and Alerts

The Fault Manager daemon notifies you that a fault or defect has been detected and diagnosed and alerts you to other changes to your system.

Configuring When and How You Will Be Notified

Use the svcs -n and svccfg listnotify commands to show event notification parameters, as shown in Showing Event Notification Parameters in Managing System Services in Oracle Solaris 11.3. Settings for notification parameters for FMA events are stored in properties in svc:/system/fm/notify-params:default. System-wide notification parameters for SMF state transition events are stored in svc:/system/svc/global:default.

Use the svccfg setnotify command to configure FMA event notification, as shown in Configuring Notification of State Transition and FMA Events in Managing System Services in Oracle Solaris 11.3. For example, the following command creates a notification that sends an SMTP message when an FMA-managed problem is repaired:

$ svccfg setnotify problem-repaired smtp:

You can configure notification of fault management error events to use the Simple Mail Transfer Protocol (SMTP) or the Simple Network Management Protocol (SNMP).

FMA event tags include problem-diagnosed, problem-updated, problem-repaired, and problem-resolved. These tags correspond to the problem lifecycle stages described in Fault Management Overview.

Event notification and FMA event tags are also described in the Notification Parameters section in the smf(5) man page. For more information about the notification daemons, see the snmp-notify(1M), smtp-notify(1M), and asr-notify(1M) man pages.

Events generated by SMF state transitions are stored in the service or in the transitioning service instance.

Understanding Messages From the Fault Manager Daemon

The Fault Manager daemon sends messages to both the console and the /var/adm/messages file. Messages from the Fault Manager daemon use the format shown in the following example except that lines in the following example that do not begin with a date actually belong with the preceding line that begins with a date:

Apr 17 15:57:35 bur-7430 fmd: [ID 377184 daemon.error] SUNW-MSG-ID: FMD-8000-CV,
TYPE: Alert, VER: 1, SEVERITY: Minor
Apr 17 15:57:35 bur-7430 EVENT-TIME: Fri Apr 17 15:56:28 EDT 2015
Apr 17 15:57:35 bur-7430 PLATFORM: SUN SERVER X4-4, CSN: 1421NM900G, HOSTNAME: bur-7430
Apr 17 15:57:35 bur-7430 SOURCE: software-diagnosis, REV: 0.1
Apr 17 15:57:35 bur-7430 EVENT-ID: b22c3c73-77d7-4f4e-8030-c589bf057bb9
Apr 17 15:57:35 bur-7430 DESC: FRU '/SYS/HDD0' has been removed from the system.
Apr 17 15:57:35 bur-7430 AUTO-RESPONSE: FMD topology will be updated.
Apr 17 15:57:35 bur-7430 IMPACT: System impact depends on the type of FRU.
Apr 17 15:57:35 bur-7430 REC-ACTION: Use 'fmadm faulty' to provide a more detailed
view of this event. Please refer to the associated reference document at
http://support.oracle.com/msg/FMD-8000-CV for the latest service procedures and
policies regarding this diagnosis.

When you are notified of a diagnosis, consult the recommended knowledge article for additional details. The recommended knowledge article is listed in the last line of the output, which is labeled REC-ACTION for recommended action. The knowledge article might contain actions that you or a service provider should take in addition to other actions listed in the REC-ACTION line.