Documentation, Support, and Training
Understanding Administrative Commands
Monitoring the InfiniBand Fabric
Controlling the InfiniBand Fabric
Understanding Signal Routing Through the Switch
Understanding Switch Startup and Component Addition
Understanding ILOM on the Switch
Switching Between the ILOM Shell and the Linux Shell
Controlling ILOM Targets (CLI)
Upgrading the Switch Firmware Through ILOM (CLI)
Administering ILOM (Web Interface)
Access ILOM From the Web Interface
Monitoring ILOM Targets (Web Interface)
Controlling ILOM Targets (Web Interface)
Upgrading the Switch Firmware Through ILOM (Web Interface)
Understanding InfiniBand Cabling
Understanding Switch Specifications
Understanding the Installation
Installing the InfiniBand Software Stack
Verifying the InfiniBand Fabric
Understanding Service Procedures
Servicing the Fabric Cards and Fabric Card Fillers
Servicing the InfiniBand Cables
Doing Supportive Software Tasks
Understanding Switch-Specific Commands
Validates InfiniBand fabric and report errors. Issued on the Linux InfiniBand host.
ibcheckerrors [-h][-b][-v][-N][topology|-C ca_name -P ca_port -t timeout]
where:
topology is the topology file.
ca_name is the channel adapter name.
ca_port is the channel adapter port.
timeout is the timeout in milliseconds.
This InfiniBand software command is a script that uses the topology file created by the ibnetdiscover command to scan the InfiniBand fabric to validate the connectivity and report errors from the port counters.
The following table describes the options to the ibcheckerrors command and their purposes:
|
The following example shows how to check error counters for all LIDs in the InfiniBand fabric with the ibcheckerrors command.
# ibcheckerrors #warn: counter SymbolErrors = 65532 (threshold 10) lid 85 port 255 Error check on lid 85 (Sun DCS 648 shmm1500 LC slot 6 switch 1) port all: FAILED #warn: counter SymbolErrors = 65535 (threshold 10) lid 1 port 255 Error check on lid 1 (Sun DCS 648 shmm1500 LC slot 6 switch 0) port all: FAILED . . . ## Summary: 55 nodes checked, 0 bad nodes found ## 1296 ports checked, 0 ports have errors beyond threshold #
Note - The output in the example is just a portion of the full output.
ibcheckerrors man page