JavaScript is required to for searching.
Skip Navigation Links
Exit Print View
Sun Storage J4500 Array Service Manual
search filter icon
search icon

Document Information

Preface

1.  Introduction to the Sun Storage J4500 Array

1.1 Features

1.2 Exterior Features, Controls, and Indicators

1.2.1 Front Panel

1.2.2 Back Panel

1.2.3 Sun Storage J4500 Array Internal Components

1.3 Accessory Kit

2.  Configuring and Powering On the Sun Storage J4500 Array

2.1 Configuration and Cabling

2.1.1 Terms and Definitions

2.1.2 Configuration Rules

2.1.3 Cabling the SAS Connectors

2.1.4 Example Configurations

2.2 Powering On and Off the Array

To Power On the Array

To Place the Array Into Standby Power Mode

To Power Off the Array

2.2.1 AC Power Failure Auto-Recovery

3.  Maintaining the Sun Storage J4500 Array

3.1 Options and Replaceable Components

3.2 Tools and Supplies Needed

3.3 Powering Off the Array and Removing It From the Rack

To Power Off the Array

To Remove the Array Enclosure From the Rack

3.4 Removing and Replacing the Hard Disk Drive Access Cover

To Remove the Hard Disk Drive Access Cover

To Replace the Hard Disk Drive Access Cover

3.5 Internal Component Locations

3.6 Replacing Components

To Replace a Fan Module

To Replace the Front Indicator Board

To Replace a Hard Disk Drive

To Replace the Power Distribution Board

To Replace a Power Supply

To Replace the System Controller Module

To Replace the Array Chassis

3.7 Upgrading Enclosure Firmware

3.7.1 Ensure Both SAS Fabrics are Upgraded to the Same Firmware Revision Level

4.  Troubleshooting

4.1 External Status LEDs

4.2 Internal Disk Drive and Fan LEDs

4.3 Diagnostic and Management Tools

4.3.1 SunVTS

4.3.2 Common Array Manager (CAM)

To Access Service Advisor Procedures

To Reserve the Array for Maintenance

To Release the Array After Maintenance

4.3.2.1 Understanding the CAM Event Log

4.4 Troubleshooting Problems with the Array

4.4.1 Initial Start-up

4.4.2 Check the Event and Performance Logs

4.4.2.1 Identifying Disks in the Array Enclosure

4.4.3 Using the Array Management Software to Monitor Enclosure Health

4.4.4 Array Link Problems

4.4.4.1 Switching SAS Cables or Making New Connections

4.4.5 Disk Problems

To Replace a Disk

4.4.5.1 Guidelines for Removal and Replacement of RAID Storage

4.4.5.2 Persistent Affiliation When Changing HBAs

4.4.5.3 If You Do Not See All of the 48 Disks

4.4.5.4 Multipath Problems With Unsupported Drives

4.4.6 Array Environment Problems

4.4.7 Power Problems

4.5 Resetting the Enclosure Hardware

To Reset the Enclosure Hardware Using the Reset Button

4.6 Clearing the Enclosure Zoning Password

To Clear the Enclosure Zoning Password

A.  System Specifications

B.  Connector Pinouts

B.1 Mini-SAS Connectors

B.2 I/O-to-Disk Backplane Connectors

B.2.1 Power Blade Connector

B.2.2 High-Speed Dock Connectors

B.3 Power Supply Connector

B.4 Disk Backplane-to-Front Indicator Connector

B.5 Backplane-to-Disk-Backplane Connector

B.6 Fan Tray Connectors

B.7 Fan Connectors

Index

4.3 Diagnostic and Management Tools

For the most part, you will need to use a combination of HBA and array management tools, log files, and enclosure LEDs to help isolate problems. However, available system level software, such as SunVTS, may contain additional tools for problem identification/resolution.

4.3.1 SunVTS

SunVTS is the Sun Validation Test Suite, which provides a comprehensive diagnostic tool that tests and validates Sun hardware by verifying the connectivity and functionality of most hardware controllers and devices on Sun platforms. SunVTS software can be tailored with modifiable test instances and processor affinity features.

SunVTS 6.2 or later software might be preinstalled on some Sun servers or included as bootable Diagnostics CD. Booting the system with the CD in the server's internal DVD drive starts SunVTS software. Diagnostic tests run and write output to log files that the service technician can use to isolate problems.

4.3.2 Common Array Manager (CAM)

The Sun StorageTek Common Array Manager (CAM) software includes the Service Advisor application, which provides guided wizards with system feedback for hardware replacement of Customer Replaceable Units (CRUs). In addition, Service Advisor provides troubleshooting procedures for alarms.


Note - All Field Replaceable Units (FRUs) are also CRUs in the J4500 array.


Before you can access Service Advisor procedures, you must have already installed the Common Array Manager software, as described in the Sun StorageTek Common Array Manager User Guide for your version of CAM.

Enclosure management (including viewing the event log and upgrading enclosure firmware) and remote command line interface (CLI) functions are performed by the Sun StorageTek Common Array Manager software.

The CRU replacement procedures available through the Sun StorageTek Common Array Manager Service Advisor application include (but are not limited to):

To Access Service Advisor Procedures

To launch Service Advisor and access hardware replacement procedures:

  1. Log on to the Sun Java Web Console on the management software host.

    For example, https://management_host_address:6789

  2. In the Storage section of the Sun Java Web Console page, select Sun StorageTek Common Array Manager.

    The navigation pane and the Storage System Summary page appear.

  3. Select an array under Storage Systems.
  4. At the top right of the Storage System Summary page, click the Service Advisor button.

    The Service Advisor application is displayed in a separate window.

  5. In the left pane, select the type of hardware replacement procedure you want to perform:
    • CRU/FRU Removal/Replacement Procedures

    • Array Utilities


    Note - If you see Service-only procedures listed, these are password protected for access by Sun service personnel only. Contact a Sun service representative for further information and assistance with service only procedures.


  6. To view a procedure, in the right pane either select it or expand its category, and select the hardware component that corresponds to the procedure.

To Reserve the Array for Maintenance

Do the following to reserve the array for maintenance. This action will alert other users that a service action is in progress when they login.

  1. From the Service Advisor, click the link to reserve the array for maintenance.
  2. Enter a description of the service action.
  3. Select the estimated duration of the service action in hours from the pull-down.
  4. Select the Reserve button.
  5. Use the back arrow to return to the procedure.

To Release the Array After Maintenance

Once the required maintenance has been performed, release the array for normal operation.

4.3.2.1 Understanding the CAM Event Log

This section provides a listing of possible J4500 array events, descriptions, and where applicable, Service action recommendations.

Refer to the Sun StorageTek Common Array Manager User Guide for your version of CAM for information on viewing system events and configuring automatic notifications.

The severity of an event in CAM is includes one of the following designations:


Note - When Auto Service Request (ASR) is enabled, it monitors the array system health and performance and automatically notifies the Sun Technical Support Center when critical events occur. Critical alarms generate an Auto Service Request case. The notifications enable SunService to respond faster and more accurately to critical on-site issues.


Table 4-3 CAM Events for the Sun Storage J4500 Array

Code
Event Name
Severity
Description
xx.5.13
ValueChangeEvent-.disk
Major/Critical
The Disk has changed state from OK to something else. Action: A disk may have been removed, or failed. Check the alarm log for additional events.
xx.5.19
ValueChangeEvent-.fan
Major/Critical
A fan has changed state from OK to something else. Action: Check fan LEDs to locate the fault and replace the faulty fan to ensure nominal system operating temperature.
xx.5.227
ValueChangeEvent-.ps
Major/Critical
A power supply has changed state from OK to something else. Action: check the event log and chassis fault LEDs to find the trouble. Replace the faulty power supply.
xx.5.586
ValueChangeEvent-.chassis
Major/Critical
Chassis has had a negative state change. Action: Look for other events that can help identify the problem, check chassis fault LEDs. Replace any failed components.
xx.5.590
ValueChangeEvent-.overTemperatureFailure
Major
The system has detected a critical over-temperature. Action: This event should have shut down the array. Look for other events that can help identify the problem. Check the array's cooling vents and environment. You will need to press the array's power button to re-apply main power to the array, Check chassis fault LEDs and replace any failed components.
xx.5.591
ValueChangeEvent-.overTemperatureWarning
Major
The system has detected a warning temperature. Action: Look for other events that can help identify the problem. Check the array's cooling vents and environment. Check chassis fault LEDs and replace any failed components.
xx.11.21
CommunicationEstablishedEvent.ib
Minor
Indicates that communication has been re-gained to the storage array via the in-band path.
xx.12.21
CommunicationLostEvent.ib
Major/Critical
Indicates that communication has been lost to the array, and that the last path successfully used was the in-band communication path.
xx.12.31
CommunicationLostEvent.oob
Major/Critical
Indicates that communication has been lost to the proxy host connected to the storage array.
xx.14.16
DiscoveryEvent
Minor
Indicates that the discovery of an array or proxy host containing one or more arrays has occurred.
xx.41.13
ComponentRemoveEvent.disk
Major/Critical
A disk has changed state from OK to a removed state. Action: Check the alarm log to determine whether the disk has failed or has been removed for maintenance.
xx.41.19
ComponentRemoveEvent.fan
Minor
A fan has changed state from OK to a removed state. Action: Check the alarm log to determine whether the fan has failed or has been removed for maintenance.
xx.41.227
ComponentRemoveEvent.ps
Minor
A power supply has changed state from OK to a removed state. Action: Check the alarm log to determine whether the power supply has failed or has been removed for maintenance.
xx.75.42
RevisionDeltaEvent.revision
Minor
The firmware revision of the enclosure is not at baseline. Action: upgrade firmware to baseline.