Sun Cluster 2.2 System Administration Guide

14.3.4 How to Replace a Sun StorEdge A5000 Disk (Solstice DiskSuite)

  1. Identify all metadevices or applications using the failing disk.

    If the metadevices are mirrored or RAID5, the disk can be replaced without stopping the metadevices. Otherwise all I/O to the disk must be stopped using the appropriate commands. For example, use the umount(1M) command to unmount a file system on a stripe or concatenation.

  2. Preserve the disk label, if needed.

    For example:

    # prvtoc /dev/rdsk/c1t3d0s2 > /tmp/c1t3d0.vtoc
    
  3. (optional) Use metareplace to replace the disk slices if the disk has not been hot-spared.

    For example:

    # metareplace d1 c1t3d0s2 c1t2d0s2
    d1: device c1t3d0s2 is replaced with c1t2d0s2
    
  4. Use luxadm -F to remove the disk. The -F is required since Solstice DiskSuite does not offline disks.

    Repeat for all hosts, if multihosted. For example:

    # luxadm remove -F /dev/rdsk/c1t3d0s2
    WARNING!!! Please ensure that no filesystems are mounted on these
     device(s).  All data on these devices should have been backed
     up.The list of devices which will be removed is: 
     1: Box Name "macs1" rear slot 1
     Please enter `q' to Quit or <Return> to Continue: stopping:  Drive
     in "macs1" rear  slot 1....Done
     offlining: Drive in "macs1" rear  slot 1....Done
     Hit <Return> after removing the device(s).

    Note -

    The FPM icon for the disk drive to be removed should be blinking. In addition, the amber LED under the disk drive should also be blinking.


  5. Remove the disk drive and enter Return.

    The output should look similar to the following:

    Hit <Return> after removing the device(s). 
     Drive in Box Name "macs1" rear slot 1   
     Removing Logical Nodes: 
     Removing c1t3d0s0 Removing c1t3d0s1 Removing c1t3d0s2 Removing
     c1t3d0s3 Removing c1t3d0s4 Removing c1t3d0s5 Removing c1t3d0s6
     Removing c1t3d0s7 Removing c2t3d0s0 Removing c2t3d0s1 Removing
     c2t3d0s2 Removing c2t3d0s3 Removing c2t3d0s4 Removing c2t3d0s5
     Removing c2t3d0s6 Removing c2t3d0s7# 
  6. Repeat for all nodes if the disk array is in a multi-host configuration.

  7. Use the luxadm insert command to insert the new disk.

    Repeat for all nodes. The output should be similar to the following:

    # luxadm insert macs1,r1
    The list of devices which will be inserted is: 
     1: Box Name "macs1" rear slot 1
     Please enter `q' to Quit or <Return> to Continue: Hit <Return>
     after inserting the device(s).
  8. Insert the disk drive and enter Return.

    The output should be similar to the following:

    Hit <Return> after inserting the device(s).  Drive in Box Name
     "macs1" rear slot 1  Logical Nodes under /dev/dsk and /dev/rdsk :
     c1t3d0s0 c1t3d0s1 c1t3d0s2 c1t3d0s3 c1t3d0s4 c1t3d0s5 c1t3d0s6
     c1t3d0s7 c2t3d0s0 c2t3d0s1 c2t3d0s2 c2t3d0s3 c2t3d0s4 c2t3d0s5
     c2t3d0s6 c2t3d0s7# 

    Note -

    The FPM icon for the disk drive you replaced should be lit. In addition, the green LED under the disk drive should be blinking.


  9. Use scdidadm(1M) to update the DID pseudo device information.

    On all nodes connected to the disk, execute the following command to update new Disk ID information.

    # scdidadm -R DID_instance
    

    where DID_instance is the instance number of the disk that was replaced. Refer to the scdidadm(1M) man page for more information.

  10. Reboot all nodes connected to the new disk.

    To avoid down time, use the haswitch(1M) command to switch ownership of all logical hosts that can be mastered by the node to be rebooted. For example,

    # haswitch phys-hahost2 hahost1 hahost2
    
  11. Label the disk, if needed.

    For example:

    # cat /tmp/c1t3d0.vtoc | fmthard -s - /dev/rdsk/c1t3d0s2
    fmthard:  New volume table of contents now in place.
  12. Replace the metadb, if needed.

    For example:

    # metadb -d c1t3d0s0; metadb -a c1t3d0s0
    
  13. Enable the new disk slices with metareplace -e.

    For example:

    # metareplace -e d0 c1t3d0s0
    d0: device c1t3d0s0 is enabled

    This completes the disk replacement procedure.