Sun Cluster 3.0 12/01 Release Notes Supplement

How to Replace a Failed Controller or Restore an Offline Controller

Use this procedure to replace a StorEdge/Netra st A1000 controller, or to restore an offline controller.

For conceptual information on SCSI reservations and failure fencing, see the Sun Cluster 3.0 12/01 Concepts.

  1. Determine if the array contains a LUN that is configured as a quorum device.


    # scstat -q
    
    • If the array does not contain a quorum device, go to Step 2.

    • If the array contains a LUN that is configured as a quorum device, choose and configure another LUN on a different array to be the new quorum device. Then remove the old quorum device.

  2. Restart the RAID Manager daemon:


    # /etc/init.d/amdemon stop
    # /etc/init.d/amdemon start
    

  3. Do you have a failed controller?

    • If your array is offline, but does not have a failed controller, go to Step 4.

    • If you have a failed controller, replace the failed controller with a new controller, but do not bring the controller online.

      For the procedure on replacing StorEdge/Netra st A1000 controllers, see the Sun StorEdge A3500/A3500FC Controller Module Guide and the Sun StorEdge RAID Manager Installation and Support Guide for additional considerations.

  4. On one node, use the RAID Manager GUI's Recovery application to restore the controller online.


    Note -

    You must use the RAID Manager GUI's Recovery application to bring the controller online.


    For information on the Recovery application, see the Sun StorEdge RAID Manager User's Guide. If you have problems with bringing the controller online, see the Sun StorEdge RAID Manager Installation and Support Guide.

  5. On one node that is connected to the StorEdge/Netra st A1000 array, verify that the controller has the correct SCSI reservation state.

    Run the scdidadm(1M) repair option (-R) on LUN 0 of the controller you want to bring online:


    # scdidadm -R /dev/dsk/cNtXdY