SNMP 和 PET 陷阱
本节介绍了由 Oracle ILOM 监视的设备生成的简单网络管理协议 (Simple Network Management Protocol, SNMP) 和平台事件陷阱 (Platform Event
Trap, PET) 消息。
SNMP 陷阱由 Oracle ILOM 管理的 SNMP 设备上启用的 SNMP 代理生成。Oracle ILOM 接收 SNMP
陷阱,并将其转换为在事件日志中显示的 SNMP 事件消息。
表 6 内存 SNMP 事件
|
|
|
|
sunHwTrapMemoryFault |
fault.memory.channel.misconfig ured |
重大-怀疑某个内存组件导致了故障。 |
/MB/P/D |
sunHwTrapMemoryFault Cleared |
fault.memory.channel.misconfig ured |
提示性-内存组件故障已清除。 |
/MB/P/D |
sunHwTrapComponentFault |
fault.memory.intel.dimm.none |
重大-怀疑某个内存组件导致了故障。 |
/MB |
|
fault.memory.conroller.inputinvalid |
|
|
|
fault.memory.controller.initfailed |
|
|
|
fault.memory.intel.dimm.popul ation-invalid |
|
|
sunHwTrapComponentFault Cleared |
fault.memory.intel.dimm.none |
提示性-内存组件故障已清除。 |
/MB |
|
fault.memory.conroller.inputinvalid |
|
|
|
fault.memory.controller.initfailed |
|
|
|
fault.memory.intel.dimm.popul
ation-invalid |
|
|
sunHwTrapMemoryFault |
fault.memory.intel.dimm.incom patible |
重大-怀疑某个内存组件导致了故障。 |
/MB/P/D |
|
fault.memory.intel.dimm.incom patible-maxranks |
|
|
|
fault.memory.intel.dimm.incom patible-quadrank |
|
|
sunHwTrapMemoryFault Cleared |
fault.memory.intel.dimm.incom patible |
提示性-内存组件故障已清除。 |
/MB/P/D |
|
fault.memory.intel.dimm.incom patible-maxranks |
|
|
|
fault.memory.intel.dimm.incom patible-quadrank |
|
|
|
表 7 环境 SNMP 事件
|
|
|
|
sunHwTrapPowerSupplyFault |
fault.chassis.env.power.loss |
重大-怀疑某个电源组件导致了故障 |
/PS |
sunHwTrapPowerSupplyFault Cleared |
Cleared fault.chassis.env.power.loss |
提示性-电源组件故障已清除 |
/PS |
sunHwTrapComponentFault |
fault.chassis.env.temp.over-fail |
重大-怀疑某个组件导致了故障 |
/SYS/ |
sunHwTrapComponentFault
Cleared |
fault.chassis.env.temp.over-fail |
提示性-组件故障已清除 |
/SYS/ |
sunHwTrapTempCritThreshold Exceeded |
Lower critical threshold exceeded |
重大-温度传感器报告其值已高于紧急上限阈值设置或低于紧急下限阈值设置 |
/DBP/T_A MB |
sunHwTrapTempCritThreshold Deasserted |
Lower critical threshold no longer exceeded |
提示性-温度传感器报告其值处于正常操作范围内 |
/DBP/T_A
MB |
sunHwTrapTempNonCrit ThresholdExceeded |
Upper noncritical threshold exceeded |
次要-温度传感器报告其值已高于非紧急上限阈值设置或低于非紧急下限阈值设置 |
/DBP/T_A MB |
sunHwTrapTempOk |
Upper noncritical threshold no longer exceeded |
提示性-温度传感器报告其值处于正常操作范围内 |
/DBP/T_A MB |
sunHwTrapTempFatalThreshol
dExceeded |
Lower fatal threshold exceeded |
紧急-温度传感器报告其值已高于致命上限阈值设置或低于致命下限阈值设置 |
/DBP/T_A MB |
sunHwTrapTempFatalThreshol dDeasserted |
Lower fatal threshold no longer exceeded |
提示性-温度传感器报告其值已低于致命上限阈值设置或高于致命下限阈值设置 |
/DBP/T_A MB |
sunHwTrapTempFatalThreshol
dExceeded |
Upper fatal threshold exceeded |
紧急-温度传感器报告其值已高于致命上限阈值设置或低于致命下限阈值设置 |
/T_AMB |
sunHwTrapTempCritThreshold Exceeded |
Upper critical threshold exceeded |
重大-温度传感器报告其值已高于紧急上限阈值设置或低于紧急下限阈值设置 |
/T_AMB |
sunHwTrapTempCritThreshold Deasserted |
Upper critical threshold no
longer exceeded |
提示性-温度传感器报告其值处于正常操作范围内 |
/T_AMB |
sunHwTrapTempFatalThreshol dDeasserted |
Upper fatal threshold no longer exceeded |
提示性-温度传感器报告其值已低于致命上限阈值设置或高于致命下限阈值设置 |
/T_AMB |
sunHwTrapComponentError |
Assert |
重大-电源传感器检测到错误 |
/HOT /PSn/Sn/V_ OUT_OK /PSn/Sn/V_ OUT_OK
/PSn/Sn/V_ OUT_OK /PSn/Sn/V_ OUT_OK /PSn/Sn/V_ OUT_OK /PSn/Sn/V_ OUT_OK |
sunHwTrapComponentOk |
Deassert |
提示性-电源传感器已恢复到正常状态 |
|
表 8 设备 SNMP 事件
|
|
|
|
sunHwTrapComponentFault |
fault.chassis.device.missing |
重大;怀疑某个组件导致了故障 |
/SYS/ |
sunHwTrapComponentFault Cleared |
fault.chassis.device.missing |
提示性;组件故障已清除 |
/SYS/ |
sunHwTrapComponentFault |
fault.chassis.device.fail |
重大;怀疑某个组件导致了故障 |
/CMM |
sunHwTrapComponentFault Cleared |
fault.chassis.device.fail |
提示性;组件故障已清除 |
/CMM |
sunHwTrapIOFault |
fault.chassis.device.fail |
重大;怀疑
IO 子系统中的某个组件导致了故障 |
/NEM |
sunHwTrapIOFaultCleared |
fault.chassis.device.fail |
提示性;IO 子系统组件故障已清除 |
/NEM |
|
表 9 电源 SNMP 事件
|
|
|
|
sunHwTrapPowerSupplyError |
Assert |
重大;电源传感器检测到错误 |
/PWRBS |
sunHwTrapPowerSupplyOk |
Deassert |
提示性;电源传感器已恢复到正常状态 |
/PWRBS |
sunHwTrapPowerSupplyFault |
fault.chassis.env.power.loss |
重大;怀疑某个电源组件导致了故障 |
/PS |
sunHwTrapPowerSupplyFault Cleared |
fault.chassis.env.power.loss |
提示性;电源组件故障已清除 |
/PS |
|
平台事件陷阱 (Platform Event Trap, PET) 事件由具有警报标准格式 (Alert Standard Format, ASF) 的系统或 IPMI
底板管理控制器生成。PET 事件可提前警告可能存在系统故障。
表 10 系统电源事件
|
|
|
|
petTrapPowerUnitState DeassertedAssert |
PowerSupply sensor ASSERT |
紧急;已发生运行时电源故障 |
/PWRBS |
petTrapPowerSupplyStat eAssertedAssert |
PowerSupply sensor DEASSERT |
提示性;电源已连接至交流电源 |
/PWRBS |
|
表 11 实体存在事件
|
|
|
|
petTrapProcessorPresence
DetectedDeassert |
EntityPresence Insert |
紧急;处理器不存在或者已被移除。 |
/HOSTPOWER /CMM/PRSNT /MB/REM/PRSNT /MB/FEM0/PRSNT /MB/FEM1/PRSNT /PEM0/PRSNT /PEM1/PRSNT /MB/P0/PRSNT /MB/P1/PRSNT /MB/P0/D0/PRSNT /MB/P0/D1/PRSNT
/MB/P0/D2/PRSNT /MB/P0/D3/PRSNT /MB/P0/D4/PRSNT /MB/P0/D5/PRSNT /MB/P0/D6/PRSNT /MB/P0/D7/PRSNT /MB/P0/D8/PRSNT /MB/P1/D0/PRSNT /MB/P1/D1/PRSNT /MB/P1/D2/PRSNT /MB/P1/D3/PRSNT /MB/P1/D4/PRSNT
/MB/P1/D5/PRSNT /MB/P1/D6/PRSNT /MB/P1/D7/PRSNT /MB/P1/D8/PRSNT /HDD0/PRSNT /HDD1/PRSNT /HDD2/PRSNT /HDD3/PRSNT/NEM0/PRSNT /NEM1/PRSNT /BL0/PRSNT /BL1/PRSNT /BL2/PRSNT
/BL3/PRSNT /PS0/PRSNT /PS1/PRSNT /PS2/PRSNT /PS3/PRSNT |
petTrapEntityPresenceDe viceInsertedAssert |
EntityPresence Remove |
提示性;设备存在或者已插入 |
/HOSTPOWER /CMM/PRSNT /MB/REM/PRSNT /MB/FEM0/PRSNT /MB/FEM1/PRSNT /PEM0/PRSNT
/PEM1/PRSNT /MB/P0/PRSNT /MB/P1/PRSNT /MB/P0/D0/PRSNT /MB/P0/D1/PRSNT /MB/P0/D2/PRSNT /MB/P0/D3/PRSNT /MB/P0/D4/PRSNT /MB/P0/D5/PRSNT /MB/P0/D6/PRSNT /MB/P0/D7/PRSNT /MB/P0/D8/PRSNT
/MB/P1/D0/PRSNT /MB/P1/D1/PRSNT /MB/P1/D2/PRSNT /MB/P1/D3/PRSNT /MB/P1/D4/PRSNT /MB/P1/D5/PRSNT /BL0/PRSNT/MB/P1/D6/PRSNT /MB/P1/D7/PRSNT /MB/P1/D8/PRSNT /HDD0/PRSNT /HDD1/PRSNT /HDD2/PRSNT
/HDD3/PRSNT /NEM0/PRSNT /NEM1/PRSNT /BL1/PRSNT /BL2/PRSNT /BL3/PRSNT /PS0/PRSNT /PS1/PRSNT /PS2/PRSNT /PS3/PRSNT |
|
表 12 环境事件
|
|
|
|
petTrapTemperatureState
DeassertedDeassert |
Temperature sensor ASSERT |
提示性;发生了温度事件 |
/HOT |
petTrapTemperatureState DeassertedDeassert |
Temperature sensor DEASSERT |
紧急;发生了温度事件 |
/HOT |
petTrapTemperatureUppe rNonRecoverableGoingL owDeassert |
Temperature Upper non-critical threshold has
been exceeded |
重大;温度已降至无法恢复的上限阈值以下 |
/MB/T_AMB |
petTrapTemperatureState AssertedAssert |
Temperature Upper non-critical threshold no longer exceeded |
紧急;发生了温度事件。可能的原因:CPU 过热。 |
/MB/T_AMB |
petTrapTemperatureUppe rCriticalGoingHigh |
Temperature Lower
fatal threshold has been exceeded |
重大;温度已升至紧急上限阈值以上 |
/MB/T_AMB |
petTrapTemperatureUppe rCriticalGoingLowDeasse rt |
Temperature Lower fatal threshold no longer
exceeded |
警告;温度已降至紧急上限阈值以下 |
/MB/T_AMB |
petTrapTemperatureLowe rNonCriticalGoingLow |
petTrapTemperatureLowe rNonCriticalGoingLow |
警告;温度已降至非紧急下限阈值以下 |
/MB/T_AMB |
警告;温度已降至非紧急下限阈值以下 |
Temperature Lower critical threshold no longer exceeded |
提示性;温度已恢复正常 |
/MB/T_AMB |
petTrapTemperatureUppe rNonCriticalGoingHigh |
Temperature Upper critical
threshold has been exceeded |
警告;温度已升至非紧急上限阈值以上 |
/MB/T_AMB |
petTrapTemperatureUppe rNonCriticalGoingLowDe assert |
Temperature Upper critical threshold no longer exceeded |
提示性;温度已恢复正常 |
/MB/T_AMB |
petTrapTemperatureLowe
rCriticalGoingLow |
Temperature Lower fatal threshold has been exceeded |
重大;温度已降至紧急下限阈值以下 |
/MB/T_AMB |
petTrapTemperatureLowe rCriticalGoingHighDeass ert |
Temperature Lower fatal threshold
no longer exceeded |
警告;温度已升至紧急下限阈值以上 |
/MB/T_AMB |
petTrapTemperatureLowe rNonRecoverableGoingH ighDeassert |
Temperature Lower non-critical threshold has been exceeded |
重大;温度已升至无法恢复的下限阈值以上 |
/MB/T_AMB |
petTrapTemperatureUppe rNonRecoverableGoingH
igh |
Temperature Lower non-critical threshold no longer exceeded |
紧急;温度已升至无法恢复的上限阈值以上 |
|
|
表 13 组件、设备和固件事件
|
|
|
|
petTrapOEMStateDeasser tedAssert |
petTrapOEMStateDeasser tedAssert |
提示性;已发生故障 (OEM
State Deasserted assert) |
/MB/FEMn/FAULT |
petTrapOEMPredictiveFai lureAsserted |
OEMReserved sensor DEASSERT |
重大;发出了 OEM 故障预警 |
/MB/FEMn/FAULT |
petTrapOEMPredictiveFai lureDeasserted |
OEMReserved reporting Predictive Failure |
提示性;已取消
OEM 故障预警 |
/CMM/ERR /NEMn/ERR /NEMn/ERR /BLn/ERR /BLn/ERR /BLn/ERR /BLn/ERR |
petTrapSystemFirmwareE rror |
OEMReserved Return to normal |
提示性;报告了系统固件错误 |
petTrapModuleBoardTran
sitionToRunningAssert |
Module Transition to Running assert |
提示性 |
/NEMn/STATE /NEMn/STATE /BLn/STATE /BLn/STATE /BLn/STATE /BLn/STATE |
petTrapModuleBoardTran sitionToInTestAssert |
Module Transition
to In Test assert |
提示性 |
|
petTrapModuleBoardTran sitionToPowerOffAssert |
Module Transition to Power Off assert |
提示性 |
|
petTrapModuleBoardTran sitionToOnLineAssert |
Module Transition
to On Line assert |
提示性 |
|
Undocumented PET 1378820 |
Module Transition to Off Line assert |
提示性 |
|
petTrapModuleBoardTran sitionToOffDutyAssert |
Module
Transition to Off Duty assert |
提示性 |
/NEMn/STATE /NEMn/STATE /BLn/STATE /BLn/STATE /BLn/STATE /BLn/STATE |
petTrapModuleBoardTran sitionToDegradedAssert |
Module Transition
to Degraded assert |
提示性 |
|
petTrapModuleBoardTran sitionToPowerSaveAssert |
Module Transition to Power Save assert |
提示性 |
|
petTrapModuleBoardInst allErrorAssert |
Module Install Error
assert |
提示性 |
|
Undocumented PET 132097 |
Voltage reporting Predictive Failure |
提示性 |
/PSn/V_IN_ERR /PSn/V_IN_ERR /PSn/V_IN_ERR /PSn/V_IN_ERR |
Undocumented PET 132096 |
Voltage Return
to normal |
提示性 |
|
表 14 电源事件
|
|
|
|
petTrapVoltageStateDeass ertedDeassert |
Voltage sensor ASSERT |
提示性;发生了电压事件 |
/PSn/V_OUT_OK /PSn/V_OUT_OK /PSn/V_OUT_OK /PSn/V_OUT_OK |
petTrapVoltageStateAsser tedDeassert |
Voltage sensor
DEASSERT |
|
表 15 风扇事件
|
|
|
|
petTrapFanPredictiveFail ureDeasserted |
Fan reporting Predictive Failure |
提示性;风扇故障预警状态已清除 |
/FMn/ERR |
petTrapFanLowerNonRecoverableGoingLow |
Fan Return to normal |
紧急;风扇速度已降至不可恢复的下限阈值以下。风扇发生故障或已移除。 |
|
|
相关信息