Sun Cluster 2.2 System Administration Guide

Chapter 13 Administering Sun StorEdge A3x00s and Sun StorEdge A1000s

This chapter provides instructions for administering the Sun StorEdge A3x00 and Sun StorEdge A1000 expansion units and disks included in your Sun Cluster configuration. Both expansion units support RAID5 hardware. The Sun StorEdge A3x00 has two RAID5 controllers, and the Sun StorEdge A1000 has only one. Most administration procedures are the same for both expansion units.

This chapter includes the following procedures:

Use the service manual for your Sun StorEdge A3x00 or Sun StorEdge A1000 disks, and your volume management software documentation, when you are replacing or repairing disk hardware in the Sun Cluster configuration.

13.1 Power Sources

The Sun StorEdge A3x00 or Sun StorEdge A1000 disk expansion unit includes redundant power sequencers. Each power sequencer supplies power to half of the expansion unit's components, so power loss to one of these power sources does not affect system availability. The controller module has redundant power supplies, so a single power failure in the controller module also will not affect system availability.

There are no special procedures required by Sun Cluster to recover in the event of a failure of the entire disk expansion unit. Follow the procedures in the disk expansion service manual to bring them back into service.

13.2 Servicing

From the perspective of the Sun Cluster software, the multiple disks of the Sun StorEdge A3x00 or the Sun StorEdge A1000 appear to be a single, highly reliable disk. Most service procedures are done using the Sun StorEdge RAID Manager software. However, if you ever need to disconnect the interface card between the node and the SCSI bus (the UDWIS card), use the procedure in this section. For example, use these procedures to change UDWIS cards or to replace the node system board.

13.2.1 How to Disconnect the UDWIS Card

  1. Use the haswitch(1M) command to migrate any logical hosts from the Sun Cluster node to be serviced.

    phys-hahost1# haswitch phys-hahost2 hahost1 hahost2
    
  2. Quiesce activity on the SCSI buses involved with the hardware service.

    If the configuration has both SCSI buses to a Sun StorEdge A3x00 or a Sun StorEdge A1000 on a common board, select one bus to perform this step. If there are multiple expansion units connected to this board, one bus on each expansion unit can be prepared in parallel. If only the UDWIS card needs replacement, other expansion unit connections can remain connected.

    Perform this step using the Maintenance/Tuning application of the RAID Manager software.

  3. Disconnect each quiesced SCSI bus cable at the disk expansion unit end and install the appropriate terminator.

    If the remaining SCSI bus of any expansion unit is also connected to this board and needs to be removed (for example, for system or I/O board replacement), route all I/O to the first SCSI bus (the one just terminated). Disconnect the SCSI cable at the expansion unit end and terminate it.

  4. The system or I/O board is now ready for servicing.

    Disconnect the SCSI cables from the prepared UDWIS cards and perform the required hardware service procedures. When complete, re-establish the original connections by continuing with the following steps. Continue with this node halted or powered off.

  5. If necessary, quiesce one SCSI bus to each expansion unit that needs reconnection.

  6. Remove the terminator on these connections.

    This is the terminator that was installed in Step 3.

  7. Reconnect the SCSI cable between the expansion unit and the UDWIS card.

  8. If another SCSI connection to an expansion unit was affected by the service procedure, route all I/O to the newly reconnected SCSI bus, remove the terminator, and recable the expansion unit to the UDWIS card.

  9. Boot this node.

  10. Restore the logical unit (LUN) distribution back to the standard configuration using the RAID Manager software.

  11. Use the haswitch(1M) command to migrate any logical hosts back to the Sun Cluster node that has been serviced.

    phys-hahost1# haswitch phys-hahost1 hahost1
    

13.3 Adding Sun StorEdge A3x00 or Sun StorEdge A1000 Enclosures

This section describes procedures used to add a Sun StorEdge A3x00 or a Sun StorEdge A1000 expansion unit. These procedures can be done with the Sun Cluster up and running.


Note -

To upgrade the firmware in expansion units, refer to the hardware service manual for your system.


13.3.1 How to Add a Disk Enclosure

  1. Switch over all logical hosts to one of the Sun Cluster nodes in the cluster that will be receiving the new disk enclosure.

    phys-hahost1# haswitch phys-hahost2 hahost1 hahost2
    
  2. Stop Sun Cluster on the node that no longer masters any logical hosts.

    phys-hahost1# scadmin stopnode
    
  3. Add the disk interface card (UDWIS) to the Sun Cluster node.

    Use the instructions that come with the UDWIS interface card.

  4. Cable the new UDWIS card to the disk enclosure.

  5. Perform a reconfiguration reboot on the node with the new UDWIS card.

    phys-hahost1# boot -r
    
  6. Start Sun Cluster on the node.

    phys-hahost1# scadmin startnode
    
  7. Switch over the logical hosts to another node in the cluster.

    phys-hahost1# haswitch phys-hahost1 hahost1 hahost2
    
  8. Perform Step 2 through Step 6 on the node that no longer masters the logical hosts.

  9. Configure the disk subsystems into the cluster.

    Use the expansion unit and SSVM or CVM documentation to set up the disk configuration.

  10. Run the haswitch(1M) command on both nodes to incorporate the new configuration into Sun Cluster.

    # haswitch -r
    

13.4 Administering Sun StorEdge A3x00 or Sun StorEdge A1000 Disks

The procedures for administering Sun StorEdge A3x00 or Sun StorEdge A1000 disks in a Sun Cluster configuration are identical to those used with nodes that are not clustered. Refer to your expansion unit documentation for procedures to add, replace, or repair disks or disk components in your disk expansion unit.

13.4.1 Adding a Sun StorEdge A3x00 or Sun StorEdge A1000 Disk

When adding drives to a Sun StorEdge A3x00 or a Sun StorEdge A1000, add the drives while the system is up and running. Do not reboot the system. Doing so may cause a loss of configuration information on the new drives, and a loss of data and logical unit (LUN) configuration on the existing drives.

If you do see a problem, such as not seeing pre-existing LUNs after adding drives and rebooting the system, then remove the newly-added drives, restart the system, then add them one at a time.

This problem does not occur when the drives are added to an "unused" disk group on a running system, after which it is safe to reboot the system. Because hot-plugging is fully supported on the Sun StorEdge A3x00 and the Sun StorEdge A1000, this is the accepted procedure.

13.4.2 Replacing a Sun StorEdge A3x00 or Sun StorEdge A1000 Disk

The Sun Cluster sees Sun StorEdge A3x00 or Sun StorEdge A1000 disks as logical units (LUNs) and not as physical disks. Because of this:

For additional information regarding logical units and physical disks, refer to your expansion unit documentation.