A P P E N D I X  D

Event Messages Available Through the ALOM Compatibility Shell

This appendix contains information about event messages. Topics include:


Event Message Overview

The firmware on the service processor (known in ALOM CMT as the SC or system controller) sends event messages to several destinations:�


Event Severity Levels

Each event has a severity level and corresponding number:�

ALOM compatibility shell configuration parameters use these severity levels to determine which event messages are displayed.


Service Processor Usage Event Messages

The following table displays usage event messages from the service processor (system controller).


TABLE D-1   System Controller Usage Event Messages
Severity Message Description
Critical Host has been powered off ALOM compatibility shell sends this message whenever the SC requests a host power off, including when a user types the poweroff command.
Critical Host has been powered off ALOM compatibility shell sends this message when the SC requires an immediate host power off, including when a user types the poweroff –f command.
Critical Host has been powered off ALOM compatibility shell sends this message when the host power has turned off. It is also normal for this event to be sent when the host has reset itself.
Major Host has been powered on ALOM compatibility shell sends this message when the SC requests a host power on, either because of sc_powerstatememory or when a user types the poweron command.
Critical

Critical

Critical

Host has been reset

Host has been powered ff

Host has been powered on

ALOM compatibility shell sends one of these messages when the SC requests a host reset, including when a user types the reset command.
Critical Host System has Reset. ALOM compatibility shell sends this message when the SC detects that the host has reset. This message is followed immediately by the Host has been powered off event message because reset is implemented as a powercycle on these systems.
Minor “root : Set : object = /clock/datetime : value = "datetime": success ALOM compatibility shell sends this message when a user types the setdate command to modify the SC date or time.
Major Upgrade succeeded ALOM compatibility shell sends this message after the SC firmware has been reloaded after operation of the flashupdate command.
Minor “root : Set : object = /HOST/bootmode/state: value = "bootmode-value": success ALOM compatibility shell sends this message after a user changes the bootmode to normal using the bootmode normal command.
Minor “root : Set : object = /HOST/bootmode/state: value = "reset_nvram": success ALOM compatibility shell sends this message after a user changes the boot mode to reset_nvram with the bootmode command.
Minor "root : Set : object = /HOST/bootmode/script: value = "text": success ALOM compatibility shell sends this message after a user changes the boot mode boot script. The boot script = "text" is the text of the boot script provided by the user.
Minor Keyswitch position has been changed to keyswitch_position. ALOM compatibility shell sends this message after a user changes the keyswitch position with the setkeyswitch command. The keyswitch_position is the new keyswitch position.
Minor "user" : open session : object = /session/type: value = www/shell: success ALOM compatibility shell sends this message when users log in. user is the name of the user who just logged in.
Minor "user" : close session : object = /session/type: value = www/shell: success ALOM compatibility shell sends this message when users log out. user is the name of the user who just logged out.
Minor "root : Set: object = /HOST/send_break_action: value = dumpcore : success ALOM compatibility shell sends this message when an ALOM compatibility shell user sends a request to the host to dump core by typing the break –D command.
Critical Host Watchdog timeout. ALOM compatibility shell sends this message when the host watchdog has timed out and the sys_autorestart variable has been set to none. The SC will not perform any corrective measures.
Critical SP Request to Dump core Host due to Watchdog. ALOM compatibility shell sends this message when the host watchdog has timed out and the sys_autorestart variable has been set to dumpcore. The SC attempts to perform a core dump of the host to capture error state information. The dump core feature is not supported by all OS versions.
Critical SP Request to Reset Host due to Watchdog. ALOM compatibility shell sends this message when the host watchdog has timed out and the sys_autorestart variable has been set to reset. Then the SC attempts to reset the host.


Environmental Monitoring Event Messages

The following table displays environmental monitoring event messages from the service processor (system contoller).


TABLE D-2   Environmental Monitoring Event Messages
Severity Message Description
Critical SP detected fault at time time. Chassis cover removed. ALOM compatibility shell sends this message if the chassis cover has been removed. The platform hardware turns managed system power off immediately as a precautionary measure. The event message System poweron is disabled should accompany this message to prevent the use of the poweron command while the chassis cover is removed.
Major System poweron is disabled. ALOM compatibility shell sends this message when the SC refuses to power on the system, either through the user poweron command or by the front panel power button. The SC disables power on because of an accompanying event, such as the event indicated by the message Chassis cover removed. Other possibilities include a device failure or insufficient fan cooling.
Major System poweron is enabled. ALOM compatibility shell sends this message after the condition that caused power on to be disabled (indicated by the preceding System poweron is disabled message) has been rectified. For example, by replacing the chassis cover or installing sufficient fans to cool the system.
Major SP detected fault at time time “fault_type 'fault' at location asserted" ALOM compatibility shell sends this message when a failure or a fault is detected. A fault is a lower priority condition that indicates the system is operating in a degraded mode. fault_type is the type of failure that has occured, such as temperature, voltage, current, or power supply. The location is the location and name of the device that has the error condition. The location and name of the device match the output of the ALOM compatibility shell showenvironment command.

This fault event message appears in the output of the ALOM compatibility shell showfaults command.

Minor SP detected fault cleared at time time current fault at device asserted. ALOM compatibility shell sends this message to indicate that a prior fault or failure has recovered or been repaired. The fields (time and device) are the same as the prior fault or failure event.
Major Device_type at location has exceeded low warning threshold.

ALOM compatibility shell sends these messages when analog measurement sensors have exceeded the specified threshold.

The threshold that was exceeded is included in the message.

Device_type is the type of device that has failed, such as VOLTAGE_SENSOR or TEMP_SENSOR. The location is the location and name of the device that has the error condition. The location and name of the device match the output of the ALOM compatibility shell showenvironment command.

For TEMP_SENSOR events, this message could indicate a problem outside of the server, such as the temperature in the room or blocked airflow in or out of the server. For VOLTAGE_SENSOR events, this message indicates a problem with the platform hardware or possibly with add-on cards installed.�

These fault event messages appear in the output of the ALOM compatibility shell showfaults command.

Critical Device_type at location has exceeded low critical shutdown hreshold.
Critical Device_type at location has xceeded low nonrecoverable shutdown threshold
Major Device_type at location has exceeded high warning hreshold
Critical Device_type at location has exceeded high soft shutdown threshold
Critical Device_type at location has exceeded high hard shutdown threshold
Minor Device_type at location is within normal range. ALOM compatibility shell sends this message when an analog measurement sensor no longer exceeds any warning or failure thresholds. This message is sent only if the sensor reading recovers sufficiently within the boundaries of the failure parameters. The message might not match the current output of the ALOM compatibility shell showenvironment command.
Critical Critical temperature value: host should be shut down ALOM compatibility shell sends this message to indicate that the SC has started a shutdown because there are not enough working fans necessary to keep the system cooled. The number of fans necessary to maintain system cooling depends on the platform. See your platform documentation for more information.
Critical Host system failed to power off. ALOM compatibility shell sends this message if the SC is unable to power off the system. This message indicates a problem with either the platform hardware or the SC hardware. The system should be manually unplugged to prevent damage to the platform hardware.�This fault event message appears in the output of the ALOM compatibility shell showfaults command.
Major

Minor

FRU_type at location has been removed.

FRU_type at location has been inserted.

ALOM compatibility shell sends these messages to indicate that a FRU has been removed or inserted. The field FRU_type indicates the type of FRU, such as SYS_FAN, PSU, or HDD. The field location indicates the location and name of the FRU, as shown in the output of the showenvironment command.
Major Input power unavailable for PSU at location. ALOM compatibility shell sends this message to indicate that a power supply is not receiving input power. This message normally indicates that the power supply is not plugged in to AC power. If the power cords are plugged in to an outlet that is provided power, this message indicates a problem with the power supply itself.�This fault event message appears in the output of the ALOM compatibility shell showfaults command.


Host Monitoring Event Messages�

The following table displays host monitoring event messages from the service processor (system controller).


TABLE D-3   Host Monitoring Event Messages
Severity Message Description
Critical SP detected fault at time time component disabled ALOM compatibility shell sends this message when a component has been disabled, either automatically by POST discovering a fault or by a user typing the disablecomponent command. component is the disabled component, which will be an entry from the platform showcomponent command.�This fault event message appears in the output of the ALOM compatibility shell showfaults command.
Minor SP detected fault cleared at component reenabled ALOM compatibility shell sends this message when a component is enabled. A component can be enabled by a user typing the enablecomponent command or by FRU replacement if the component itself is a FRU (such as a DIMM). component is the name of the component shown in the output of the platform showcomponent command.
Major Host detected fault, MSGID: SUNW-MSG-ID ALOM compatibility shell sends this message when the Solaris PSH software diagnoses a fault. The SUNW-MSG-ID of the fault is an ASCII identifier that can be entered at http://www.sun.com/msg for more information about the nature of the fault and the steps to repair.�This fault event message appears in the output of the ALOM compatibility shell showfaults command.
Major Location has been replaced;faults cleared. ALOM compatibility shell sends this message after the replacement of a FRU that contained a host-detected fault. Location is the location and name of the FRU that was replaced. This event can be received at SC boot or after FRUs have been swapped and the chassis cover is closed.
Major Existing faults detected in FRU_PROM at location. ALOM compatibility shell sends this message to indicate that the SC has detected a new FRU with pre-existing faults logged into its FRU PROM. This event can occur when either a FRU or the SC card is moved from one system to another. The location is the name of the SEEPROM on the replaced FRU, such as MB/SEEPROM.�The most recent existing fault will be imported from the FRU PROM onto the showfaults list. The entry on the showfaults list is the fault imported, not this message.