Exit Print View

Sun Datacenter InfiniBand Switch 72 User’s Guide

Get PDF Book Print View
 

Document Information

Using This Documentation

Related Documentation

Documentation, Support, and Training

Documentation Feedback

Installing the Switch

Understanding Switch Specifications

Routing Service Cables

Understanding InfiniBand Cabling

Understanding the Installation

Shipping Carton Contents

Install the Switch in the Rack

Powering On the Switch

Connecting InfiniBand Cables

Verifying the InfiniBand Fabric

Discover the InfiniBand Fabric Topology

Perform Diagnostics on the InfiniBand Fabric

Validate the InfiniBand Fabric and Report Errors

Administering the Switch

Troubleshooting the Switch

Administrative Command Overview

Monitoring the Hardware

Monitoring the InfiniBand Fabric

Controlling the Hardware

Controlling the InfiniBand Fabric

Servicing the Switch

Understanding Service Procedures

Servicing the Power Supplies

Servicing the Fans

Servicing the InfiniBand Cables

Servicing the Battery

Upgrading the Firmware

Index

Validate the InfiniBand Fabric and Report Errors

The ibcheckerrors command uses the topology file to scan the InfiniBand fabric and validate the connectivity as described in the topology file, and to report errors as indicated by the port counters.

  1. Identify the prerequisite and subsequent installation tasks that you must perform in conjunction with this procedure.

    See Installation Sequence.

  2. On the management controller, type:

    # ibcheckerrors 
    #warn: counter RcvSwRelayErrors = 48342         (threshold 100) lid 25 port 255
    Error check on lid 25 (Sun DCS 72 QDR FC switch o4nm2-72p-2) port all:  FAILED 
    #warn: counter RcvSwRelayErrors = 56839         (threshold 100) lid 25 port 28
    Error check on lid 25 (Sun DCS 72 QDR FC switch o4nm2-72p-2) port 28:  FAILED 
    #warn: counter RcvSwRelayErrors = 56839         (threshold 100) lid 25 port 9
    Error check on lid 25 (Sun DCS 72 QDR FC switch o4nm2-72p-2) port 9:  FAILED 
    #warn: counter SymbolErrors = 65535     (threshold 10) lid 20 port 255
    Error check on lid 20 (Sun DCS 72 QDR switch 1.2(LC)) port all:  FAILED 
    .
    .
    .
    ## Summary: 6 nodes checked, 0 bad nodes found
    ##          144 ports checked, 2 ports have errors beyond threshold
    #

    Note - The output for your InfiniBand fabric will differ from that in the example.


Related Information