SNMP and PET Traps
This section describes Simple Network Management Protocol (SNMP) and Platform Event Trap (PET)
messages that are generated by devices being monitored by Oracle ILOM.
SNMP traps are generated by SNMP agents that are enabled on the SNMP
devices being managed by Oracle ILOM. Oracle ILOM receives the SNMP traps and
converts them into SNMP event messages that appear in the event log.
Table 7 Memory SNMP Events
|
|
|
|
sunHwTrapMemoryFault |
fault.memory.channel.misconfig ured |
Major — A memory
component is suspected of causing a fault. |
/MB/P/D |
sunHwTrapMemoryFault Cleared |
fault.memory.channel.misconfig ured |
Informational — A memory
component fault has been cleared. |
/MB/P/D |
sunHwTrapComponentFault |
fault.memory.intel.dimm.none |
Major — A memory component is suspected of causing
a fault. |
/MB |
|
fault.memory.conroller.inputinvalid |
|
|
|
fault.memory.controller.initfailed |
|
|
|
fault.memory.intel.dimm.popul ation-invalid |
|
|
sunHwTrapComponentFault Cleared |
fault.memory.intel.dimm.none |
Informational — A memory component fault has been
cleared. |
/MB |
|
fault.memory.conroller.inputinvalid |
|
|
|
fault.memory.controller.initfailed |
|
|
|
fault.memory.intel.dimm.popul ation-invalid |
|
|
sunHwTrapMemoryFault |
fault.memory.intel.dimm.incom patible |
Major — A memory component is suspected of causing
a fault. |
/MB/P/D |
|
fault.memory.intel.dimm.incom patible-maxranks |
|
|
|
fault.memory.intel.dimm.incom patible-quadrank |
|
|
sunHwTrapMemoryFault Cleared |
fault.memory.intel.dimm.incom patible |
Informational — A memory component fault has been
cleared. |
/MB/P/D |
|
fault.memory.intel.dimm.incom patible-maxranks |
|
|
|
fault.memory.intel.dimm.incom patible-quadrank |
|
|
|
Table 8 Environmental SNMP Events
|
|
|
|
sunHwTrapPowerSupplyFault |
fault.chassis.env.power.loss |
Major —
A power supply component is suspected of causing a fault |
/PS |
sunHwTrapPowerSupplyFault Cleared |
Cleared fault.chassis.env.power.loss |
Informational —
A power supply component fault has been cleared |
/PS |
sunHwTrapComponentFault |
fault.chassis.env.temp.over-fail |
Major — A component
is suspected of causing a fault |
/SYS/ |
sunHwTrapComponentFault Cleared |
fault.chassis.env.temp.over-fail |
Informational — A component fault
has been cleared |
/SYS/ |
sunHwTrapTempCritThreshold Exceeded |
Lower critical threshold exceeded |
Major — A temperature sensor
has reported that its value has gone above an upper critical threshold setting
or below a lower critical threshold setting |
/DBP/T_A MB |
sunHwTrapTempCritThreshold Deasserted |
Lower critical threshold no longer
exceeded |
Informational — A temperature sensor has reported that its value is in
the normal operating range |
/DBP/T_A MB |
sunHwTrapTempNonCrit ThresholdExceeded |
Upper noncritical threshold exceeded |
Minor — A
temperature sensor has reported that its value has gone above an upper critical
threshold setting or below a lower critical threshold setting |
/DBP/T_A MB |
sunHwTrapTempOk |
Upper noncritical threshold no
longer exceeded |
Informational — A temperature sensor has reported that its value is
in the normal operating range |
/DBP/T_A MB |
sunHwTrapTempFatalThreshol dExceeded |
Lower fatal threshold exceeded |
Critical —
A temperature sensor has reported that its value has gone above an upper
fatal threshold setting or below a lower fatal threshold setting |
/DBP/T_A MB |
sunHwTrapTempFatalThreshol dDeasserted |
Lower fatal
threshold no longer exceeded |
Informational — A temperature sensor has reported that its
value has gone below an upper fatal threshold setting or above a lower
fatal threshold setting |
/DBP/T_A MB |
sunHwTrapTempFatalThreshol dExceeded |
Upper fatal threshold exceeded |
Critical — A temperature
sensor has reported that its value has gone above an upper fatal threshold
setting or below a lower fatal threshold setting |
/T_AMB |
sunHwTrapTempCritThreshold Exceeded |
Upper critical threshold exceeded |
Major —
A temperature sensor has reported that its value has gone above an
upper critical threshold setting or below a lower critical threshold setting |
/T_AMB |
sunHwTrapTempCritThreshold Deasserted |
Upper critical
threshold no longer exceeded |
Informational — A temperature sensor has reported that its
value is in the normal operating range |
/T_AMB |
sunHwTrapTempFatalThreshol dDeasserted |
Upper fatal threshold no longer
exceeded |
Informational — A temperature sensor has reported that its value has gone
below an upper fatal threshold setting or above a lower fatal threshold setting |
/T_AMB |
sunHwTrapComponentError |
Assert |
Major
— A power supply sensor has detected an error |
/HOT /PSn/Sn/V_ OUT_OK
/PSn/Sn/V_ OUT_OK /PSn/Sn/V_ OUT_OK /PSn/Sn/V_ OUT_OK /PSn/Sn/V_ OUT_OK /PSn/Sn/V_ OUT_OK |
sunHwTrapComponentOk |
Deassert |
Informational — A
power supply sensor has returned to its normal state |
|
Table 9 Device SNMP Events
|
|
|
|
sunHwTrapComponentFault |
fault.chassis.device.missing |
Major; A component is suspected of causing a fault |
/SYS/ |
sunHwTrapComponentFault Cleared |
fault.chassis.device.missing |
Informational;
A component fault has been cleared |
/SYS/ |
sunHwTrapComponentFault |
fault.chassis.device.fail |
Major; A component is suspected of causing
a fault |
/CMM |
sunHwTrapComponentFault Cleared |
fault.chassis.device.fail |
Informational; A component fault has been cleared |
/CMM |
sunHwTrapIOFault |
fault.chassis.device.fail |
Major; A component in
the IO subsystem is suspected of causing a fault |
/NEM |
sunHwTrapIOFaultCleared |
fault.chassis.device.fail |
Informational; An IO subsystem component
fault has been cleared |
/NEM |
|
Table 10 Power Supply SNMP Events
|
|
|
|
sunHwTrapPowerSupplyError |
Assert |
Major; A
power supply sensor has detected an error |
/PWRBS |
sunHwTrapPowerSupplyOk |
Deassert |
Informational; A power supply sensor has
returned to its normal state |
/PWRBS |
sunHwTrapPowerSupplyFault |
fault.chassis.env.power.loss |
Major; A power supply component is suspected of causing
a fault |
/PS |
sunHwTrapPowerSupplyFault Cleared |
fault.chassis.env.power.loss |
Informational; A power supply component fault has been cleared |
/PS |
|
Platform Event Trap (PET) events are generated by systems with Alert Standard Format
(ASF) or an IPMI baseboard management controller. The PET events provide advance warning
of possible system failures.
Table 11 System Power Events
|
|
|
|
petTrapPowerUnitState DeassertedAssert |
PowerSupply
sensor ASSERT |
Critical; A run-time power fault has occurred |
/PWRBS |
petTrapPowerSupplyStat eAssertedAssert |
PowerSupply sensor DEASSERT |
Informational; Power
supply is connected to AC Power |
/PWRBS |
|
Table 12 Entity Present Events
|
|
|
|
petTrapProcessorPresence DetectedDeassert |
EntityPresence Insert |
Critical; A processor is absent or has been removed. |
/HOSTPOWER /CMM/PRSNT
/MB/REM/PRSNT /MB/FEM0/PRSNT /MB/FEM1/PRSNT /PEM0/PRSNT /PEM1/PRSNT /MB/P0/PRSNT /MB/P1/PRSNT /MB/P0/D0/PRSNT /MB/P0/D1/PRSNT /MB/P0/D2/PRSNT /MB/P0/D3/PRSNT /MB/P0/D4/PRSNT /MB/P0/D5/PRSNT /MB/P0/D6/PRSNT /MB/P0/D7/PRSNT
/MB/P0/D8/PRSNT /MB/P1/D0/PRSNT /MB/P1/D1/PRSNT /MB/P1/D2/PRSNT /MB/P1/D3/PRSNT /MB/P1/D4/PRSNT /MB/P1/D5/PRSNT /MB/P1/D6/PRSNT /MB/P1/D7/PRSNT /MB/P1/D8/PRSNT /HDD0/PRSNT /HDD1/PRSNT /HDD2/PRSNT /HDD3/PRSNT/NEM0/PRSNT /NEM1/PRSNT
/BL0/PRSNT /BL1/PRSNT /BL2/PRSNT /BL3/PRSNT /PS0/PRSNT /PS1/PRSNT /PS2/PRSNT /PS3/PRSNT |
petTrapEntityPresenceDe viceInsertedAssert |
EntityPresence Remove |
Informational; A device is present
or has been inserted |
/HOSTPOWER /CMM/PRSNT /MB/REM/PRSNT /MB/FEM0/PRSNT /MB/FEM1/PRSNT /PEM0/PRSNT /PEM1/PRSNT /MB/P0/PRSNT /MB/P1/PRSNT /MB/P0/D0/PRSNT /MB/P0/D1/PRSNT
/MB/P0/D2/PRSNT /MB/P0/D3/PRSNT /MB/P0/D4/PRSNT /MB/P0/D5/PRSNT /MB/P0/D6/PRSNT /MB/P0/D7/PRSNT /MB/P0/D8/PRSNT /MB/P1/D0/PRSNT /MB/P1/D1/PRSNT /MB/P1/D2/PRSNT /MB/P1/D3/PRSNT /MB/P1/D4/PRSNT /MB/P1/D5/PRSNT /BL0/PRSNT/MB/P1/D6/PRSNT /MB/P1/D7/PRSNT
/MB/P1/D8/PRSNT /HDD0/PRSNT /HDD1/PRSNT /HDD2/PRSNT /HDD3/PRSNT /NEM0/PRSNT /NEM1/PRSNT /BL1/PRSNT /BL2/PRSNT /BL3/PRSNT /PS0/PRSNT /PS1/PRSNT /PS2/PRSNT /PS3/PRSNT |
|
Table 13 Environmental Events
|
|
|
|
petTrapTemperatureState DeassertedDeassert |
Temperature sensor ASSERT |
Informational; Temperature event
occurred |
/HOT |
petTrapTemperatureState DeassertedDeassert |
Temperature sensor DEASSERT |
Critical; Temperature event occurred |
/HOT |
petTrapTemperatureUppe rNonRecoverableGoingL owDeassert |
Temperature Upper non-critical threshold
has been exceeded |
Major; Temperature has decreased below upper non-recoverable threshold |
/MB/T_AMB |
petTrapTemperatureState AssertedAssert |
Temperature Upper non-critical
threshold no longer exceeded |
Critical; Temperature event occurred. Possible cause: CPU is too hot. |
/MB/T_AMB |
petTrapTemperatureUppe
rCriticalGoingHigh |
Temperature Lower fatal threshold has been exceeded |
Major; Temperature has increased above upper
critical threshold |
/MB/T_AMB |
petTrapTemperatureUppe rCriticalGoingLowDeasse rt |
Temperature Lower fatal threshold no longer exceeded |
Warning; Temperature has
decreased below upper critical threshold |
/MB/T_AMB |
petTrapTemperatureLowe rNonCriticalGoingLow |
petTrapTemperatureLowe rNonCriticalGoingLow |
Warning; Temperature has decreased below lower
non-critical threshold |
/MB/T_AMB |
Warning; Temperature has decreased below lower non-critical threshold |
Temperature Lower critical threshold no longer
exceeded |
Informational; Temperature has returned to normal |
/MB/T_AMB |
petTrapTemperatureUppe rNonCriticalGoingHigh |
Temperature Upper critical threshold has been
exceeded |
Warning; Temperature has increased above upper non-critical threshold |
/MB/T_AMB |
petTrapTemperatureUppe rNonCriticalGoingLowDe assert |
Temperature Upper critical threshold
no longer exceeded |
Informational; Temperature has returned to normal |
/MB/T_AMB |
petTrapTemperatureLowe rCriticalGoingLow |
Temperature Lower fatal threshold
has been exceeded |
Major; Temperature has decreased below lower critical threshold |
/MB/T_AMB |
petTrapTemperatureLowe rCriticalGoingHighDeass ert |
Temperature
Lower fatal threshold no longer exceeded |
Warning; Temperature has increased above lower critical
threshold |
/MB/T_AMB |
petTrapTemperatureLowe rNonRecoverableGoingH ighDeassert |
Temperature Lower non-critical threshold has been exceeded |
Major; Temperature has increased
above lower non-recoverable threshold |
/MB/T_AMB |
petTrapTemperatureUppe rNonRecoverableGoingH igh |
Temperature Lower non-critical threshold no longer exceeded |
Critical; Temperature
has increased above upper non-recoverable threshold |
|
|
Table 14 Component, Device, and Firmware Events
|
|
|
|
petTrapOEMStateDeasser
tedAssert |
petTrapOEMStateDeasser tedAssert |
Informational; A fault has occurred (OEM State Deasserted assert) |
/MB/FEMn/FAULT |
petTrapOEMPredictiveFai lureAsserted |
OEMReserved sensor
DEASSERT |
Major; OEM Predictive Failure Asserted |
/MB/FEMn/FAULT |
petTrapOEMPredictiveFai lureDeasserted |
OEMReserved reporting Predictive Failure |
Informational; OEM Predictive Failure
Deasserted |
/CMM/ERR /NEMn/ERR /NEMn/ERR /BLn/ERR /BLn/ERR /BLn/ERR /BLn/ERR |
petTrapSystemFirmwareE rror |
OEMReserved Return to normal |
Informational; System
Firmware Error reported |
petTrapModuleBoardTran sitionToRunningAssert |
Module Transition to Running assert |
Informational |
/NEMn/STATE /NEMn/STATE /BLn/STATE /BLn/STATE /BLn/STATE
/BLn/STATE |
petTrapModuleBoardTran sitionToInTestAssert |
Module Transition to In Test assert |
Informational |
|
petTrapModuleBoardTran sitionToPowerOffAssert |
Module Transition to Power Off
assert |
Informational |
|
petTrapModuleBoardTran sitionToOnLineAssert |
Module Transition to On Line assert |
Informational |
|
Undocumented PET 1378820 |
Module Transition to Off
Line assert |
Informational |
|
petTrapModuleBoardTran sitionToOffDutyAssert |
Module Transition to Off Duty assert |
Informational |
/NEMn/STATE /NEMn/STATE /BLn/STATE /BLn/STATE /BLn/STATE
/BLn/STATE |
petTrapModuleBoardTran sitionToDegradedAssert |
Module Transition to Degraded assert |
Informational |
|
petTrapModuleBoardTran sitionToPowerSaveAssert |
Module Transition to Power Save assert |
Informational |
|
petTrapModuleBoardInst
allErrorAssert |
Module Install Error assert |
Informational |
|
Undocumented PET 132097 |
Voltage reporting Predictive Failure |
Informational |
/PSn/V_IN_ERR /PSn/V_IN_ERR /PSn/V_IN_ERR /PSn/V_IN_ERR |
Undocumented
PET 132096 |
Voltage Return to normal |
Informational |
|
Table 15 Power Supply Events
|
|
|
|
petTrapVoltageStateDeass
ertedDeassert |
Voltage sensor ASSERT |
Informational; Voltage event occurred |
/PSn/V_OUT_OK /PSn/V_OUT_OK /PSn/V_OUT_OK /PSn/V_OUT_OK |
petTrapVoltageStateAsser tedDeassert |
Voltage sensor DEASSERT |
|
Table 16 Fan Events
|
|
|
|
petTrapFanPredictiveFail ureDeasserted |
Fan reporting Predictive Failure |
Informational; Fan
Predictive Failure state has been cleared |
/FMn/ERR |
petTrapFanLowerNonRecoverableGoingLow |
Fan Return to normal |
Critical; Fan speed has
decreased below lower non-recoverable threshold. Fan failed or removed. |
|
|
Related Information