C H A P T E R  8

Initial Inspection of the Server

This chapter includes the following topics:


Service Visit Troubleshooting Flowchart

Use the following flowchart as a guideline for using this guide to troubleshoot the Sun Fire Sun Fire X4500/X4540 Servers server.

FIGURE 8-1 Troubleshooting Flowchart


Graphic showing suggested steps for troubleshooting problems during a service visit, using the sections in this book.


Gathering Service Visit Information

Use the following general guidelines when you begin troubleshooting.

1. Collect initial service visit information, from the service-call paperwork or onsite personnel, about the following items:

2. Document the exisiting server settings before you make any changes.

Record the BIOS version, software version and server serial numbers. Check the product notes to view issues associated with the server hardware and software.

3. Adjust the exisiting server settings to correct the problem.

If possible, make one change at a time in order to isolate potential problems. Use this method to maintain a controlled environment and reduce troubleshooting.

4. Note the changes made and results of any change you make.

Include any errors or informational messages.

5. Check for potential device conflicts before you add a new device.

6. Check for version dependencies, especially with third-party software.

7. If the problem is not evident, continue with the next section, Troubleshooting Power Problems.


Troubleshooting Power Problems

Do one of the following.

1. Check that AC power cords are attached firmly to the server’s power supplies and to the AC sources.

Use of the cable clamps will ensure that the AC power cords are attached to the server’s power supplies. FIGURE 8-3 shows AC power cords on the rear panel.

2. Check that the server covers, including hard disk drive access cover, system controller cover, and fan access cover are firmly in place.

Refer to the cover labels. An intrusion switch on the system controller shuts the server down when the hard disk drive access cover is removed.

3. Investigate the conditions that can trigger an automatic shutdown sequence:

A power-off sequence is initiated by a request from either of the following items:

or



Note - Any power supply that is out of spec causes a reset, but only power supplies that remain out of spec for more than 100 mS cause a shutdown.



External Inspection of the Server

Improperly set controls and loose or improperly connected cables are common causes of problems with hardware components.

To perform a visual inspection of the external system:

1. Inspect the front panel LEDs for indications of component malfunction.

FIGURE 8-2 shows the front panel controls and indicators. TABLE 8-1 describes the controls and indicators.

FIGURE 8-2 Sun Fire X4540 Server Front Panel LEDs


Graphic showing the X4540 server front panel with the status indicator LEDs called out on the upper left and on the front of the hard disk drives.


TABLE 8-1 Front Panel Controls and Indicators

#

Name

Color

Description

1

Locate button/LED

White

Operators can turn this LED On remotely to help then locate the server in a crowded server room. Press to turn off.

Pressing the Locate LED/Switch for five seconds turns all indicators ON for 15 seconds.

2

System Fault

White

On - When service action is required.

3

Power/Operation

Green

Steady - Power is On.
Blink - Standby power is On but main power is Off.
Off - Power is Off.

4

System power button

Grey

To power on main power for all the server components.

5

Top failure LED

Amber

On - HDD or fan fault.

6

Rear failure LED

Amber

On - Power supply, or system controller fault (service is required).

7

Over Temperature LED

Amber

On - When system is over temperature.


2. Inspect the back panel LEDs for indications of component malfunction.

FIGURE 8-3 shows the rear panel features. TABLE 8-2 describes each feature.

FIGURE 8-3 Sun Fire X4540 Server Rear Panel LEDs


 [ D ]


TABLE 8-2 Rear Panel Features

#

Name

Description

1

AC power connectors

Verify that the PS LEDs are green. Each power supply has its own AC connector with a clip to secure its power cable.

2

Locate button/LED

White Operators can turn this LED On remotely to help then locate the server in a crowded server room. Press to turn off.

3

Fault LED

Amber - When on, service action required.

Steady - Power is On.
Off - Power is Off.

4

OK LED

Green - Service action allowed.

When On, service action is required.

Blink - Standby power is On but main power is Off.

5

System controller status LEDs

Blue - Ready to remove.

Amber - Fault, service action required.

Green - Operational, no action required.


For additional LED locations and descriptions, see Identifying Status and Fault LEDs.

3. Verify that nothing in the server environment is blocking air flow or making a contact that could short out power.

4. If the problem is not evident, continue with the next section, Internal Inspection of the Server.


Internal Inspection of the Server

To perform a visual inspection inside the server:

1. Shut down the server, from main power to standby power mode.

Choose one of the following methods, using a non-conducting ballpoint pen or stylus. See FIGURE 8-4.

After main power is off, the Power/OK LED on the front panel blinks once every three seconds, indicating that the server is in standby power mode.



caution icon Caution - You must disconnect the AC power cords from the back panel of the server, to completely power off the server. When you use the Power button to enter standby power mode, power is still applied to the graphics-redirect and service processor (GRASP) board and power supply fans, indicated when the Power/OK LED is blinking.


 

FIGURE 8-4 Sun Fire X4540 Server Front Panel


Graphic showing the X4540 server front panel with the status indicator LEDs called out on the upper left and on the front of the hard disk drives.


Figure Legend

1

Power Button

2

Power/OK LED


2. Remove the component covers, including hard disk drive cover, system controller cover, and fan cover, as required.

FIGURE 8-5 shows the server internal components. For instructions on removing the component covers, refer to the Sun Firetrademark X4540 Server Service Manual, 819-4359.

FIGURE 8-5 Sun Fire X4540 Server Internal Components


 [ D ]

3. Inspect the internal status indicator LEDs, which can indicate component malfunction.

For LED locations and descriptions, see Internal Status Indicator LEDs and DIMM Fault LEDs.



Note - You can hold down the Locate button on the server back panel or front panel for 5 seconds to initiate a “push-to-test” mode that illuminates all other LEDs both inside and outside of the chassis for 15 seconds.


4. Verify that there are no loose or improperly seated components.

5. Verify that all cable connectors inside the system are firmly and correctly attached to their appropriate connectors.

6. Verify that any after-factory components are qualified and supported.

For a list of supported PCI cards and DIMMs, refer to the Sun Fire X4540 Server Service Manual, 819-4359.

7. Check that the installed DIMMs comply with the supported DIMM population rules and configurations, as described in Chapter 10, Troubleshooting DIMM Problems.

8. Replace the component covers.

9. To restore main power mode to the server (all components powered on), use a non-conducting ballpoint pen or stylus to press and release the Power button on the server front panel. See FIGURE 8-4.

When main power is applied to the full server, the Power/OK LED next to the Power button lights and remains lit.

10. If the problem with the server is not evident, you can try viewing the power-on self test (POST) messages and BIOS event logs during system startup. Continue with Viewing Event Logs.