C H A P T E R  1

Initial Inspection of the Server

This chapter includes the following topics:


Service Troubleshooting Flowchart

Use the following table as a guideline for using the subjects in this book to troubleshoot the server.


TABLE 1-1 Troubleshooting Sections by Task

To perform this task

Refer to this section

Gather initial service information.

Gathering Service Information

Investigate any powering-on problems.

Troubleshooting Power Problems

Perform external visual inspection and internal visual inspection.

Externally Inspecting the Server

Internally Inspecting the Server

Chapter 3

View BIOS event logs and POST messages.

Viewing Event Logs

Power-On Self-Test (POST)

View service processor logs and sensor information...

Using the ILOM Service Processor GUI to View System Information

...or view service processor logs and sensor information.

Using IPMItool to View System Information

Run SunVTS diagnostics.

Diagnosing Server Problems With the Bootable Diagnostics CD



Gathering Service Information

The first step in determining the cause of a problem with the server is to gather information from the service-call paperwork or the onsite personnel. Use the following general guideline steps when you begin troubleshooting.

To gather service information:

1. Collect information about the following items:

2. Document the server settings before you make any changes.

If possible, make one change at a time in order to isolate potential problems. In this way, you can maintain a controlled environment and reduce the scope of troubleshooting.

3. Take note of the results of any change that you make. Include any errors or informational messages.

4. Check for potential device conflicts before you add a new device.

5. Check for version dependencies, especially with third-party software.


System Inspection

Controls that have been improperly set and cables that are loose or improperly connected are common causes of problems with hardware components.

Troubleshooting Power Problems

1. Check that AC power cords are attached firmly to the server’s power supplies and to the AC sources.

2. Check that the main cover is firmly in place.

There is an intrusion switch on the motherboard that automatically shuts down the server power to standby mode when the cover is removed.

Externally Inspecting the Server

To perform a visual inspection of the external system:

1. Inspect the external status indicator LEDs, which can indicate component malfunction.

For the LED locations and descriptions of their behavior, see External Status Indicator LEDs.

2. Verify that nothing in the server environment is blocking air flow or making a contact that could short out power.

3. If the problem is not evident, continue with the next section, Internally Inspecting the Server.

Internally Inspecting the Server

To perform a visual inspection of the internal system:

1. Choose a method for shutting down the server from main power mode to standby power mode. See FIGURE 1-1 and FIGURE 1-2.



caution icon Caution - Performing an emergency shutdown can cause open files to become corrupt. Use an emergency shutdown only when necessary.


When main power is off, the Power/OK LED on the front panel will begin flashing, indicating that the server is in standby power mode.



caution icon Caution - When you use the Power button to enter standby power mode, power is still directed to service processor and power supply fans, indicated when the Power/OK LED is flashing. To completely power off the server, you must disconnect the AC power cords from the back panel of the server.


FIGURE 1-1 X4140 Server Front Panel


Graphic showing the Sun Fire X4140 front panel.


Figure Legend

1

Locate button/LED

2

Power button


FIGURE 1-2 X4240 and X4440 Server Front Panel


Graphic showing the Sun Fire X4240 and X4440 front panel.


Figure Legend

1

Locate button/LED

2

Power button


2. Remove the server cover.

For instructions on removing the server cover, refer to your server’s service manual.

3. Inspect the internal status indicator LEDs. These can indicate component malfunction.

For the LED locations and descriptions of their behavior, see Internal Status Indicator LEDs.



Note - The server must be in standby power mode for viewing the internal LEDs.


You can hold down the Locate button on the server back panel or front panel for 5 seconds to initiate a “push-to-test” mode that illuminates all other LEDs both inside and outside of the chassis for 15 seconds.

4. Verify that there are no loose or improperly seated components.

5. Verify that all cable connectors inside the system are firmly and correctly attached to their appropriate connectors.

6. Verify that any after-factory components are qualified and supported.

For a list of supported PCI cards and DIMMs, refer to your server’s service manual.

7. Check that the installed DIMMs comply with the supported DIMM population rules and configurations, as described in DIMM Population Rules.

8. Replace the server cover.

9. To restore the server to main power mode (all components powered on), use a ballpoint pen or other stylus to press and release the Power button on the server front panel. See FIGURE 1-1 and FIGURE 1-2.

When main power is applied to the full server, the Power/OK LED next to the Power button lights and remains lit.

10. If the problem with the server is not evident, you can obtain additional information by viewing the power-on self test (POST) messages and BIOS event logs during system startup. Continue with Viewing Event Logs.