6 Systems Infrastructure Switch

This chapter provides information about the Systems Infrastructure Switch metrics. For each metric, it provides the following information:
  • Description

  • Metric table

The metric table can include some or all of the following: target version, default collection frequency, default warning threshold, default critical threshold, and alert text.

Access Point Response

The metric in this category provides information about the status of the access point.

There is no collection frequency defined for these metrics because the Access Point Response metrics are invoked by the platform to compute Access Point availability.

Status

This metric provides the status of the access point.

Target Version Default Warning Threshold Default Critical Threshold Alert Text

All versions

Not Defined

0

Access point is down.

Network Ports Ethernet Events

The metrics in this category provide information about the network ports ethernet events.

Port OID Index

This metric provides the index number of the port OID.

Admin State

This metric provides the information about the administrative state of the link.

Operational Status

This metric provides the operational status of the port.

Network Ports InfiniBand Events

The metrics in this category provide information about the network ports InfiniBand events.

Port OID Index

This metric provides the index number of the port OID.

Active Speed

This metric provides the active speed of the link since the last collection.

Active Width

This metric provides the width of the link since the last collection.

Counter Value

This metric provides the counter value.

Error Rate Interval

This metric provides the error rate interval.

Link State

This metric provides the link state.

Description

This metric provides a description of the event.

Node Index

This metric provides the node index.

Node Lid

This metric provides the local identifier (LID).

Error Counter

This metric provides the error counter.

Address

This metric provides the address.

Symbol Error Increase

This metric provides the increase in symbol errors.

Power Supply Status

The metrics in this category provide information about the power supply status.

Power Supply Name

This metric provides the name of the power supply.

Target Version Collection Frequency

All versions

Every 15 Minutes

Sensor Value Units

This metric provides the units for the sensor value.

Target Version Collection Frequency

All versions

Every 15 Minutes

Power Supply Status

This metric provides the status of the power supply.

Target Version Collection Frequency

All versions

Every 15 Minutes

Power Supply Sensor Value

This metric provides the value of the power supply sensor.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text

All versions

Every 15 Minutes

Not Defined

3

State of the power supply %KeyValue% is either warning or critical. Current state code is %value%. (state code values are mapped as 1=normal,2=warning,3=critical,4=shutdown,5=notPresent,6=notFunctioning)

Response

This metric category provides information about the status of the Systems Infrastructure Switch target.

Status

This metric provides the status of the Systems Infrastructure Switch target.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text

All versions

Every Minute

Not Defined

0

%target% is Down.

Sensor Status

The metrics in this category provide information about the status of the sensor.

Sensor Identifier

This metric provides the ID of the sensor.

Target Version Collection Frequency

All versions

Every Hour

Component Identifier

This metric provides the ID of the hardware component.

Target Version Collection Frequency

All versions

Every Hour

Read Value

This metric provides the read value of the sensor.

Target Version Collection Frequency

All versions

Every Hour

Status

This metric provides the status of the sensor.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text

All versions

Every Hour

Not Defined

critical

Fault found in sensor %SensorId%.

Value

This metric provides the value from the sensor.

Target Version Collection Frequency

All versions

Every Hour

Switch Ports Statistics

The metrics in this category provide information about the switch ports statistics.

Incoming error rate

This metric provides the rate of incoming errors.

Target Version Collection Frequency

All versions

Every 15 Minutes

Incoming throughput

This metric provides the incoming throughput.

Target Version Collection Frequency

All versions

Every 15 Minutes

Total number of incoming errors

This metric provides the total number of incoming errors.

Target Version Collection Frequency

All versions

Every 15 Minutes

Total number of incoming octets

This metric provides the total number of incoming octets.

Target Version Collection Frequency

All versions

Every 15 Minutes

Total number of outgoing errors

This metric provides the total number of outgoing errors.

Target Version Collection Frequency

All versions

Every 15 Minutes

Total number of outgoing octets

This metric provides the total number of outgoing octets.

Target Version Collection Frequency

All versions

Every 15 Minutes

Outgoing error rate

This metric provides the rate of outgoing errors.

Target Version Collection Frequency

All versions

Every 15 Minutes

Switch Basic Status

The metrics in this category provide information about the switch basic status.

Booted On

This metric indicates whether the switch is booted.

Target Version Collection Frequency

All versions

Every 15 Minutes

Locator Light On

This metric indicates whether the locator light is on.

Target Version Collection Frequency

All versions

Every 15 Minutes

Powered On

This metric indicates whether the switch is powered on.

Target Version Collection Frequency

All versions

Every 15 Minutes

Status

This metric provides the status of the switch.

Target Version Collection Frequency

All versions

Every 15 Minutes

Switch Aggregated System Status

The metrics in this category provide information about the status of the switch aggregated system.

Cable State

This metric provides the state of the cable connection

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text

All versions

Every 15 Minutes

Not Defined

FAULTED

Overall cable state in system %KeyValue% is faulted.

Cable State Change

This metric provides the change in cable connectivity.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text

All versions

Every 15 Minutes

Not Defined

FAULTED

Change in cable connectivity in system %KeyValue% is faulted.

Cooling Redundancy

This metric provides the cooling redundancy state.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text

All versions

Every 15 Minutes

Not Defined

FAULTED

Cooling redundancy state in system %KeyValue% is faulted.

Cooling State

This metric provides the overall cooling state.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text

All versions

Every 15 Minutes

Not Defined

FAULTED

Overall cooling state in system %KeyValue% is faulted.

Health Status

This metric provides the health status.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text

All versions

Every 15 Minutes

Not Defined

FAULTED

Overall state of the system %KeyValue% is faulted.

InfiniBand State

This metric provides the InfiniBand state.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text

All versions

Every 15 Minutes

Not Defined

FAULTED

State of the InfiniBand module in system %KeyValue% is faulted.

Locator Light

This metric provides the locator light.

Target Version Collection Frequency

All versions

Every 15 Minutes

Power Redundancy

This metric provides the power redundancy state.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text

All versions

Every 15 Minutes

Not Defined

FAULTED

Power redundancy state in system %KeyValue% is faulted.

Power State

This metric provides the overall power state.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text

All versions

Every 15 Minutes

Not Defined

FAULTED

Overall power state in system %KeyValue% is faulted.

Temperature State

This metric provides the overall temperature state.

Target Version Evaluation and Collection Frequency Default Warning Threshold Default Critical Threshold Alert Text

All versions

Every 15 Minutes

Not Defined

FAULTED

Overall temperature state in system %KeyValue% is faulted.

All versions

Every 15 Minutes

Not Defined

CRITICAL

Overall temperature state in system %KeyValue% is critical.

Voltage State

This metric provides the voltage state.

Target Version Collection Frequency

All versions

Every 15 Minutes