12 Integrated Lights Out Manager Metrics

This chapter provides information about the Integrated Lights Out Manager (ILOM) metrics.

For each metric, it provides the following information:

  • Description

  • Metric table

    The metric table can include some or all of the following: target version, default collection frequency, default warning threshold, default critical threshold, and alert text.

The Oracle ILOM plug-in monitors the Oracle ILOM service processor in a compute node for hardware events and records sensor data to the Oracle Enterprise Manager Repository.

The ILOM plug-in is deployed to the Oracle Management Agent on the first compute node in an Oracle Database system, and only that Management Agent communicates with the Oracle Management Server and Repository for all ILOM database server service processors in the Oracle Database system.

12.1 Component Fault

This metric category describes component failure alerts.

12.1.1 Fault Status

This metric provides the component failure status.

The following table shows how often the metric's value is collected and compared against the default thresholds.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text
12c Every 5 Minutes Not Defined CRITICAL Component %ComponentName% has a fault.

12.2 Fan Fault

This metric category describes fan failure alerts.

12.2.1 Fault Status

This metric provides the fan failure status.

The following table shows how often the metric's value is collected and compared against the default thresholds.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text
12c Every 5 Minutes Not Defined CRITICAL The fan %FanName% has a fault.

12.3 Fan Sensors

This metric category describes the fan sensor metrics.

12.3.1 Sensor Speed (RPM)

This is the speed of the ILOM fan, in revolutions per minute (RPM).

The following table shows how often the metric's value is collected and compared against the default thresholds.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text
12c Every 5 Minutes Not Defined -1 Fan %SensorName% encountered a fault: %SensorSpeed% (-1: Predictive Failure, -2: Fan Missing/Removed, -3: Fan Not Readable/Not Present, -4: General Fault, -5: Not Spinning/Obstructed).

Data Source

The data for this metric is collected using the operating system (OS) line token fetchlet by running the FanSensorStatus.plscript.

User Action

No user action is required.

12.3.2 Sensor State

This metric reports the status of the ILOM fan.

The following table shows how often the metric's value is collected and compared against the default thresholds.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text
12c Every 5 Minutes FAULT_DIAGNOSED|FAULT_SUSPECTED|WARNING CRITICAL|ERROR|FAILED|FAULTED|NOT_PRESENT|NON_RECOVERABLE|PREDICTIVE_FAILURE_ASSERTED|LOWER_CRITICAL|UPPER_CRITICAL|LOWER_NON_RECOVERABLE|UPPER_NON_RECOVERABLE Fan %SensorName% rotating at %SensorSpeed%(rpm) encountered a fault: %SensorState%

Data Source

The data for this metric is collected using the OS line token fetchlet by running the FanSensorStatus.plscript.

User Action

No user action is required.

12.4 Hard Disk Status

The metrics in this category provide information about the hard disk status.

12.4.1 Fault Status (0 - cleared, 1 - critical)

This metric reports the status of the hard disk. 0 indicates Cleared, 1 indicates Critical.

The following table shows how often the metric's value is collected and compared against the default thresholds.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text
12c Every 5 Minutes Not Defined 1 The hard disk %HardDiskName% has a fault. Fault code is %FaultCode%.

Data Source

The data for this metric is collected using the OS line token fetchlet by running the NodeStatusCheck.pl - ping script.

User Action

No user action is required.

12.5 HCA Port State (For Alerts)

The metrics in this category describe the host channel adapters (HCA) port state .

12.5.1 Is Port Disabled?

This metric indicates whether the HCA port is disabled.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text
12c Every 5 Minutes Not Defined 1 Port %PortNumber%(%ca_disp_name%) is disabled.

12.5.2 Is Port in 'polling' state?

This metric indicates whether the HCA port is checking or polling for a peer port.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text
12c Every 5 Minutes Not Defined 1 Port %PortNumber%(%ca_disp_name%) is polling for peer port. This could happen when the cable is unplugged from one of the ends or the other end port is disabled.

12.6 ILOM Temperatures

This metric category contains the ILOM temperatures metrics.

12.6.1 Inlet Ambient Temperature

This metric shows the inlet ambient temperature for the ILOM target in degrees Celsius.

The following table shows how often the metric's value is collected and compared against the default thresholds.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text
12c Every 5 Minutes Not Defined Not Defined Current Inlet temperature for ILOM %target% is %value% degree Celcius.

12.6.2 Outlet Ambient Temperature

This metric shows the outlet ambient temperature for the ILOM target in degrees Celsius.

The following table shows how often the metric's value is collected and compared against the default thresholds.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text
12c Every 5 Minutes Not Defined Not Defined Current Outlet temperature for ILOM %target% is %value% degree Celcius.

12.6.3 System Ambient Temperature

This metric shows the system ambient temperature for the ILOM target in degrees Celsius.

The following table shows how often the metric's value is collected and compared against the default thresholds.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text
12c Every 5 Minutes Not Defined Not Defined Current System temperature for ILOM %target% is %value% degree Celcius.

12.7 Memory Fault

This metric category contains the memory failure alert metric.

12.7.1 Fault Status

This metric provides the memory failure status.

The following table shows how often the metric's value is collected and compared against the default thresholds.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text
12c Every 5 Minutes Not Defined CRITICAL Memory %MemoryName% has a fault.

12.8 Processor Fault

This metric category contains the CPU failure alert metric.

12.8.1 Fault Status

This metric provides the CPU failure status.

The following table shows how often the metric's value is collected and compared against the default thresholds.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text
12c Every 5 Minutes Not Defined CRITICAL The processor %ProcessorName% has a fault.

12.9 Sensor Alerts

This metric category contains the sensor alert metrics.

12.9.1 Current Sensor Description

This metric provides a description of the current sensor status.

The following table shows how often the metric's value is collected.

Target Version Collection Frequency
12c Every 5 Minutes

Data Source

The data for this metric is collected using the OS line token fetchlet by running the GetSensorAlerts script .

User Action

No user action is required.

12.9.2 Current Sensor Status

This metric shows the current status of the ILOM sensor.

The following table shows how often the metric's value is collected and compared against the default thresholds.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text
12c Every 5 Minutes WARNING - * CRITICAL - * Current sensor(s) at level - %CurrentStatusDesc%.

Data Source

The data for this metric is collected using the OS line token fetchlet by running the GetSensorAlerts script.

User Action

No user action is required.

12.9.3 Fan Sensor Status

This metric shows the status of the sensor for the ILOM fan.

The following table shows how often the metric's value is collected and compared against the default thresholds.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text
12c Every 5 Minutes WARNING - * CRITICAL - * Fan sensor(s) at level - %FanStatusDesc%.

Data Source

The data for this metric is collected using the OS line token fetchlet by running the GetSensorAlerts script.

User Action

No user action is required.

12.9.4 Fan Sensor Status Description

This metric shows the description of the fan sensor status.

The following table shows how often the metric's value is collected.

Target Version Collection Frequency
12c Every 5 Minutes

Data Source

The data for this metric is collected using the OS line token fetchlet by running the GetSensorAlerts script.

User Action

No user action is required.

12.9.5 Power Supply Sensor Description

This metric shows the description of the power supply sensor status.

The following table shows how often the metric's value is collected.

Target Version Collection Frequency
12c Every 5 Minutes

Data Source

The data for this metric is collected using the OS line token fetchlet by running the GetSensorAlerts script.

User Action

No user action is required.

12.9.6 Power Supply Sensor Status

This metric shows the status of the sensor for the ILOM power supply.

The following table shows how often the metric's value is collected and compared against the default thresholds.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text
12c Every 5 Minutes WARNING - * CRITICAL - * Power supply sensor(s) at level - %PowerSupplyStatusDesc%.

Data Source

The data for this metric is collected using the OS line token fetchlet by running the GetSensorAlerts script.

User Action

No user action is required.

12.9.7 Temperature Sensor Description

This metric shows the description of the temperature sensor status.

The following table shows how often the metric's value is collected.

Target Version Collection Frequency
12c Every 5 Minutes

Data Source

The data for this metric is collected using the OS line token fetchlet by running the GetSensorAlerts script.

User Action

No user action is required.

12.9.8 Temperature Sensor Status

This metric shows the status of the sensor for the ILOM temperature.

The following table shows how often the metric's value is collected and compared against the default thresholds.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text
12c Every 5 Minutes WARNING - * CRITICAL - * Temperature sensor(s) at level - %TemperatureStatusDesc%.

Data Source

The data for this metric is collected using the OS line token fetchlet by running the GetSensorAlerts script.

User Action

No user action is required.

12.9.9 Voltage Sensor Description

This metric provides a description of the voltage sensor status.

The following table shows how often the metric's value is collected.

Target Version Collection Frequency
12c Every 5 Minutes

Data Source

The data for this metric is collected using the OS line token fetchlet by running the GetSensorAlerts script.

User Action

No user action is required.

12.9.10 Voltage Sensor Status

This metric shows the status of the sensor for ILOM voltage.

The following table shows how often the metric's value is collected and compared against the default thresholds.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text
12c Every 5 Minutes WARNING - * CRITICAL - * Voltage sensor(s) at level - %VoltageStatusDesc%.

Data Source

The data for this metric is collected using the OS line token fetchlet by running the GetSensorAlerts script.

User Action

No user action is required.

12.10 Service Processor Information

This metric category contains the service processor information metrics.

12.10.1 Check Physical Presence

This metric provides a flag that indicates whether a user must press the Locator button on the physical system to recover the ILOM administrator password.

The following table shows how often the metric's value is collected.

Target Version Collection Frequency
12c Every 5 Minutes

Data Source

The data for this metric is collected using the OS line token fetchlet by running the GetSpInformation script.

User Action

No user action is required.

12.10.2 Host Name

This metric provides an ILOM host name as a method of network identification.

The following table shows how often the metric's value is collected.

Target Version Collection Frequency
12c Every 5 Minutes

Data Source

The data for this metric is collected using the OS line token fetchlet by running the GetSpInformation script.

User Action

No user action is required.

12.10.3 Reset to Defaults

This metric provides a flag that indicates whether the system has been told to reset to defaults.

The following table shows how often the metric's value is collected.

Target Version Collection Frequency
12c Every 5 Minutes

Data Source

The data for this metric is collected using the OS line token fetchlet by running the GetSpInformation script.

User Action

No user action is required.

12.10.4 System Contact

This metric provides a contact person and method of contact for the ILOM.

The following table shows how often the metric's value is collected.

Target Version Collection Frequency
12c Every 5 Minutes

Data Source

The data for this metric is collected using the OS line token fetchlet by running the GetSpInformation script.

User Action

No user action is required.

12.10.5 System Description

This metric provides a description of this ILOM.

The following table shows how often the metric's value is collected.

Target Version Collection Frequency
12c Every 5 Minutes

Data Source

The data for this metric is collected using the OS line token fetchlet by running the GetSpInformation script.

User Action

No user action is required.

12.10.6 System Identifier

This metric provides an ILOM system identifier property, which helps identify the managed device in the payload element of an SNMP trap.

The following table shows how often the metric's value is collected.

Target Version Collection Frequency
12c Every 5 Minutes

Data Source

The data for this metric is collected using the OS line token fetchlet by running the GetSpInformation script.

User Action

No user action is required.

12.10.7 System Location

This metric shows the physical location of the ILOM.

The following table shows how often the metric's value is collected.

Target Version Collection Frequency
12c Every 5 Minutes

Data Source

The data for this metric is collected using the OS line token fetchlet by running the GetSpInformation script.

User Action

No user action is required.

12.11 Temperature Sensors

This metric category contains the temperature sensor metrics.

12.11.1 Sensor Reading (degree C)

This metric shows the ILOM temperature, in degrees Celsius.

The following table shows how often the metric's value is collected and compared against the default thresholds.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text
12c Every 4 Minutes Not Defined Not Defined The temperature sensor %SensorName% operating at %SensorReading%(degree C) has exceeded its threshold.

Data Source

The data for this metric is collected using the OS line token fetchlet by running the TempSensorStatus.pl script.

User Action

No user action is required.

12.11.2 Sensor State

This metric shows the status of the ILOM temperature.

The following table shows how often the metric's value is collected.

Target Version Collection Frequency
12c Every 4 Minutes

Data Source

The data for this metric is collected using the OS line token fetchlet by running the TempSensorStatus.pl script.

User Action

No user action is required.

12.12 Voltage Sensors

This metric category contains the voltage sensor metrics.

12.12.1 Sensor Reading (Volts)

This metric reports the ILOM voltage reading, in volts.

The following table shows how often the metric's value is collected and compared against the default thresholds.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text
12c Every 4 Minutes Not Defined Not Defined The voltage sensor %SensorName% operating at %SensorReading%(Volts) has exceeded its threshold.

Data Source

The data for this metric is collected using the OS line token fetchlet by running the VoltSensorStatus.pl script.

User Action

No user action is required.

12.12.2 Sensor State

This metric shows the status of the ILOM voltage.

The following table shows how often the metric's value is collected.

Target Version Collection Frequency
12c Every 5 Minutes

Data Source

The data for this metric is collected using the OS line token fetchlet by running the VoltSensorStatus.pl script.

User Action

No user action is required.