C H A P T E R 8 |
Sun StorEdge 3310 SCSI Controller FRUs |
This chapter provides instructions for removing and installing SCSI controller modules in Sun StorEdge 3310 SCSI arrays.
Topics covered in this chapter are:
Note - Be sure to follow Section 1.2, Static Electricity Precautions for all procedures. |
SCSI controller modules are hot-swappable. In the event that it is impossible or impractical to halt I/O from hosts to the array, a controller can be replaced while the surviving controller is active and servicing I/O.
However, if your configuration enables you to halt I/O during the controller replacement procedure without disruption, it is a good idea to do so. With
write-back cache enabled under heavy I/O, cache synchronization can take three hours or more to complete. During this time, your controller will display "preparing failback" status and the LED remains amber.
It is also possible that hardware errors or configuration errors may be present but undetected, resulting in an unserviceable condition that will only be revealed when the hot-swap is underway. For instance, it is possible that a hardware module might be improperly seated.
It is good practice before replacing a controller FRU to try to verify your system's health. Do not replace a working controller on a channel that is offline.
When a controller fails in a dual-controller configuration, the remaining controller automatically becomes the primary controller, if it is not already. If you replace a controller that has not failed, force-fail the controller to be replaced via one of the following methods.
For details on the fail, show redundancy, and show events commands, refer to the Sun StorEdge 3000 Family CLI User's Guide.
To provide maximum usability, the Sun StorEdge 3310 SCSI array controller module has controller firmware version 3.66 which is only available in SCSI FRUs and which provides the following functions:
When a controller is replaced in a dual-controller configuration, the controller firmware of the remaining functional controller automatically overwrites the firmware of the new replacement controller to maintain compatibility. This is referred to as cross-loading. Cross-loading uses the NVRAM configuration settings to synchronize the firmware version of the newly installed controller to match the firmware version of the running controller.
To upgrade the 3.66 firmware to 4.1x firmware for a single-controller configuration, refer to the patch README instructions.
To downgrade the 3.66 firmware to 3.25 firmware for a single-controller configuration, refer to Section 8.6.1, Downgrading From Controller Firmware Version 3.66 to 3.25.
When replacing a controller FRU in a single-controller chassis, be sure to follow the instructions in Section 8.6, SCSI Controller Replacement for a Single-Controller Array.
It is a recommended practice to always check the SunSolve Download Center located on http://sunsolve.sun.com for the latest available firmware upgrades.
Note - Be sure to check and update SAF-TE firmware in all configurations to provide the best performance. The latest SAF-TE firmware is included in each controller firmware patch. |
This section includes the following topics:
Before replacing a controller module, save the NVRAM configuration settings to disk. The saved NVRAM file can be used to restore configuration settings only if the replacement controller has the same firmware version as the defective controller.
1. From the RAID firmware Main Menu, choose "system Functions controller maintenance save NVRAM to disks."
A message informs you that NVRAM information has been successfully saved.
To determine the firmware version of your SCSI controller, check the model number. TABLE 8-1 shows the model numbers for SCSI controllers.
To replace a 4.1x SCSI controller module with a 3.25 SCSI controller module, see Section A.2, Replacing a 4.1x Controller with a 3.25 Controller (Sun StorEdge 3310 SCSI Controller FRUs Only).
For all other dual-controller configurations, perform the following procedures.
2. Remove all cables from the controller module.
3. Turn the thumbscrews on the left and right sides of the controller module counterclockwise until the thumbscrews are disengaged from the chassis.
4. Hold the thumbscrews, and pull out the controller module.
Caution - DO NOT POWER OFF the chassis when you replace a controller module. Multiple problems can occur. If you power off the array and replace a controller module in a dual-controller configuration, the replacement controller could become the primary controller and overwrite any configuration settings previously set. Additionally, if the array is powered off incorrectly, data that is written to cache and that has not been completely written to the disks will be lost. If you powered off the array during replacement, see Section 8.3.4, Restoring the Configuration Settings of a Powered-Off Array. |
2. Gently slide the replacement controller module into the array.
Caution - Be sure that the module is properly inserted into the guide rails of the array and that you keep the power on. If you power off and then replace the module, you will have to continue with additional steps. See Section 8.3.4, Restoring the Configuration Settings of a Powered-Off Array. |
a. Insert the controller FRU into the slot, and push forward until you begin to feel resistance as the connecting pins engage.
b. Push the controller FRU until the connecting pins are fully seated and the RAID controller fits flush against the back plate of the RAID array.
Failure to insert the controller carefully as described above can cause one of the following problems:
3. Turn the thumbscrews on the left and right sides of the controller module clockwise until they are finger-tight, to secure the module and to make the module's front panel flush with the chassis.
To ensure that a thumbscrew is finger-tight, tighten it with a screwdriver and then loosen the thumbscrew counterclockwise a quarter turn.
The new controller automatically becomes the secondary controller.
In a redundant controller configuration where a new controller FRU is installed, the controller status LED will remain amber until the controllers complete the redundant controller process, which can take more than 10 minutes. Identical firmware versions on both controllers is required for proper redundant controller operation.
The redundant controller process automatically cross-loads the firmware version of the newly installed controller FRU to match the firmware version of the other running controller. For example, if the running controller has firmware 4.15C and the new controller has 4.15E, the new controller will be cross-loaded with the 4.15C firmware of the running controller.
To monitor this process, see Section 8.3.3, Monitoring the Automatic Firmware Update for a Recently Installed Controller FRU.
4. If you want the most current version of firmware on your controllers, download the latest firmware patch as described in the release notes for your array.
5. Reconnect the original cables to the new controller module.
To monitor the status of the automatic firmware update, use the Sun StorEdge CLI show redundancy command. The Sun StorEdge CLI will display the progression of "Failed," "Scanning," "Detected" and "Enabled" states.
If the array was inadvertently powered off during the controller replacement in a dual-controller configuration, you must perform the following steps to ensure successful operation of your array.
1. In a dual-controller configuration where both controllers have the same version number (such as 4.15C and 4.15E), power on the array and wait a minimum of 10 minutes for firmware cross-load to occur.
2. In a dual-controller configuration where the controller modules have different firmware versions (such as 3.25 and 3.66), perform the following steps:
b. Remove the 3.66 controller module. See Section 8.3.1, Removing a SCSI Controller Module for instructions.
c. Power up the array with the 3.25 controller module still in the array and allow a few minutes to complete the initialization.
d. Confirm that controller version 3.25 is active by entering one of the following commands.
The firmware version is the Revision number.
e. Re-insert the 3.66 controller module. See Section 8.3.2, Installing a SCSI Controller Module for instructions.
f. Wait a minimum of 10 minutes for the firmware cross-load to be completed.
3. Confirm that the secondary controller is active on the array by entering one of the following commands.
The Redundant Controller: Enabled status indicates an initialized secondary redundant controller.
The Redundancy Status: Enabled indicates an initialized secondary redundant controller.
If one controller fails in a dual-controller configuration, you might want to run a single controller for an extended period of time so that the array does not display as degraded.
For instructions on converting a dual-controller configuration to a single-controller configuration, refer to the section titled "To Convert a Dual Controller Array to a Single Controller Array" in the Sun StorEdge 3000 Family Configuration Service User's Guide.
Note - SCSI single-controller arrays ship with a blanking panel covering the secondary controller slot. This must be removed in order to install a secondary controller. |
1. Keep the array powered on and make sure that the connected hosts are inactive.
2. Turn the thumbscrews on the left and right sides of the blanking panel and remove it.
3. Gently slide the new controller module into the array.
Caution - Be sure that the module is properly inserted in the guide rails of the array and that you keep the power on. |
a. Insert the controller FRU into the slot, and push forward until you begin to feel resistance as the connecting pins engage.
b. Push the controller FRU until the connecting pins are fully seated and the RAID controller fits flush against the back plate of the RAID array.
Failure to insert the controller carefully as described above can cause one of the following problems:
4. Turn the thumbscrews on the left and right sides of the controller module clockwise until they are finger-tight to secure the module and to make the module's front panel flush with the chassis.
To ensure that a thumbscrew is finger-tight, tighten it with a screwdriver and then loosen the thumbscrew counterclockwise a quarter turn.
The new controller automatically becomes the secondary controller.
When the new controller FRU is installed, the controller status LED will remain amber until the controllers complete the redundant controller process, which can take more than 10 minutes. The same firmware versions must be installed on both controllers for proper redundant-controller operation.
The redundant-controller process automatically cross-loads the firmware version of the newly installed controller FRU to match the firmware version of the other running controller. For example, if the running controller has firmware 4.12B and the new controller has 4.15, the new controller will be cross-loaded with the 4.12B firmware of the running controller. To monitor this process, see Section 8.3.3, Monitoring the Automatic Firmware Update for a Recently Installed Controller FRU.
5. If you want the most current version of firmware on your controllers, download the latest firmware patch as described in the release notes for your array.
6. Set up the host channels for the new controller module.
For host channel set up information, refer to the "Connecting Your Array" chapter in the Sun StorEdge 3000 Family Installation, Operation, and Service Manual for your array.
Caution - You must set the hosts to the correct host channels on the controller module or your configuration will not work correctly. |
To replace a SCSI controller module in a single-controller configuration, perform the following steps.
1. If possible, make a record of the firmware version and configuration settings before replacing the controller.
a. Use the show configuration CLI command to output the configuration settings to a file. Refer to the Sun StorEdge 3000 Family CLI User's Guide for more information.
b. Save NVRAM configuration settings to disk.
From the RAID firmware Main Menu choose "system Functions Controller maintenance Save nvram to disks," and choose Yes to save the contents of NVRAM to disk.
c. Record the Controller Unique Identifier which combines the serial number and MAC address for each chassis and is used for network connections.
From the RAID firmware Main Menu, choose "view and edit Configuration parameters Controller Parameters Controller Unique Identifier <hex>."
a. Keep the array powered on and make sure that the connected hosts are inactive.
b. Remove all cables from the controller module.
c. Turn the thumbscrews on the left and right sides of the controller module counterclockwise until the thumbscrews are disengaged from the chassis.
d. Hold the thumbscrews, and carefully pull out the controller module.
3. Insert the replacement controller.
b. Insert the controller FRU into the slot, and push forward until you begin to feel resistance as the connecting pins engage.
c. Push the controller FRU until the connecting pins are fully seated and the RAID controller fits flush against the back plate of the RAID array.
Caution - Be sure that the module is properly inserted into the guide rails of the array. |
d. Turn the thumbscrews on the left and right sides of the controller module clockwise until they are finger-tight, to secure the module and to make the module's front panel flush with the chassis.
To ensure that a thumbscrew is finger-tight, tighten it with a screwdriver and then loosen the thumbscrew counterclockwise a quarter turn.
4. Reconnect the original cables to the new controller module.
5. Download the desired firmware version from SunSolve.
a. To downgrade from firmware 3.66 to firmware 3.25, follow the instructions in Section 8.6.1, Downgrading From Controller Firmware Version 3.66 to 3.25.
b. To upgrade from firmware 3.66 to firmware 4.1x, refer to the patch README file supplied with the firmware.
6. Confirm that the "Controller Unique Identifier" parameter is set to the value recorded in Step 1c.
a. From the RAID firmware Main Menu, choose "view and edit Configuration parameters Controller Parameters Controller Unique Identifier <hex>."
b. If the "Controller Unique Identifier" is not set to the value recorded in step 1c, type the value 0 (to automatically read the chassis serial number from the midplane) or type the hex value for the original serial number of the chassis (used when the midplane has been replaced).
The Controller Unique Identifier is used to create Ethernet MAC addresses and worldwide names. The value 0 is immediately replaced with the hex value of the chassis serial number. A nonzero value should be specified only if the chassis has been replaced, but the original chassis serial number must be retained; this feature is especially important in a Sun Cluster environment, to maintain the same disk device names in a cluster.
c. To implement the revised configuration settings, choose "system Functions Reset controller" from the RAID firmware Main Menu, then choose Yes to confirm.
7. If the Sun StorEdge Configuration Service agent was stopped, restart it.
If other software such as StorADE was stopped, restart it following the procedures in the documentation for that software.
If you have a 3.66 SCSI replacement controller module for a single-controller configuration (see TABLE 8-1 for controller part numbers), you must downgrade the controller firmware to the released 3.25 firmware version or upgrade to a released 4.1x firmware version.
The 3.25 firmware works with the latest SAF-TE, PLD, and software versions. There is no need to downgrade any of these components if you downgrade the controller version. For instance, CLI version 2.0 is compatible with controller version 3.27. CLI 1.6.2 can be used to downgrade to 3.25 controller firmware but cannot be used to upgrade to 4.1x controller firmware.
The CLI download controller-firmware command restores factory defaults with downgrades and does not restore:
1. Change your working directory to the directory in which the patch was unpacked and confirm that the SUN325W-3310.bin file is present using the ls command.
2. Invoke the Sun StorEdge CLI.
3. Verify that you are running CLI version 1.6.2 or 2.x with the version command.
4. Verify the product and revision of the array by typing the following command:
Confirm that the correct product name is displayed. Otherwise, this patch does not apply. Select a different device or discontinue installation of this patch. If the firmware version reported is 3.66, continue with Step 5.
5. If feasible, save the configuration in a separate location.
If you are unable to restore the 3.25 configuration, you can reference this file.
where filename.txt is a text file, or
where filename.xml is an xml file.
6. Stop all I/O to the array before beginning the controller firmware downgrade and unmount any filesystems or volumes mounted from the array.
7. At the sccli> prompt, type:
where filename is SUN325W-3310.bin for the Sun StorEdge 3310 SCSI array.
Note - Disregard the CLI message that a script is available to automate the download; the script is only for upgrades. |
The download controller-firmware command will display messages indicating that it is downloading the firmware, programming the controller's flash memory, and "engaging" the new firmware. Wait until the sccli> prompt appears again before proceeding. This might take 10 minutes or more.
Caution - DO NOT POWER OFF the array or remove a controller FRU within 10 minutes of performing a controller firmware upgrade, or the controller may be rendered inoperable. |
8. To re-establish communication with the array, use the serial connection to restore the IP address, netmask, and gateway.
9. To complete the downgrade, access the CLI and enter the following commands:
10. To re-establish communication with the array, use the serial connection to restore the IP address, netmask, and gateway.
11. Verify the firmware revision of the array by typing the following command:
Confirm that the firmware revision is now reported as 325W for the SCSI array.
12. Reconfigure your array to the desired configuration.
a. If you saved a 3.25 configuration file to restore, type:
where filename is the name of the file that contains the configuration.
b. Otherwise, configure the array using the CLI or the firmware application.
Caution - Do not restore a 4.1x configuration on a 3.25 controller. This may cause data loss. |
13. Confirm that the Controller Unique Identifier parameter value is correct.
From the RAID firmware Main Menu, choose "view and edit Configuration parameters Controller Parameters Controller Unique Identifier <hex>."
14. If the "Controller Unique Identifier" is not set correctly, perform the following steps.
a. Type the value 0 (to automatically read the chassis serial number from the midplane) or type the hex value for the original serial number of the chassis (used when the midplane has been replaced).
The Controller Unique Identifier is used to create Ethernet MAC addresses and worldwide names. The value 0 is immediately replaced with the hex value of the chassis serial number. A nonzero value should be specified only if the chassis has been replaced, but the original chassis serial number must be retained; this feature is especially important in a Sun Cluster environment to maintain the same disk device names in a cluster.
b. To implement the revised configuration settings, choose "system Functions Reset controller" from the RAID firmware Main Menu, then choose Yes to confirm.
15. If the Sun StorEdge Configuration Service agent was stopped, restart it.
If other software such as StorADE was stopped, restart it following the procedures in the documentation for that software.
Copyright © 2007, Dot Hill Systems Corporation. All rights reserved.