2 Diagnostics Overview





This chapter describes some of the applicable diagnostic software used to test the Desktop Storage Module.

The diagnostic software available includes the following:

When to Use Diagnostics

You should use the appropriate diagnostic tool to solve specific problems. Table 2-1 summarizes the available diagnostic tools, and indicates the correct use of each one.

    Table 2-1 Summary of Available Diagnostic Tools

Diagnostic Tool When or why to use On-Board Diagnostics These diagnostics reside in the system's boot PROM. Tests
such as the Ethernet test and the probe-scsi test are
available. You must be in the Forth Toolkit, ok prompt, to
run these diagnostics. Described later in this chapter.
SunDiag System Exerciser These diagnostics are runs under SunOS. It displays real-time
use of the system resources and peripherals. The SunDiag
System Exerciser tells you if your system and attached devices
are functioning correctly. See the SunDiag User's Guide for
more information.
Sundiagnostic Executive Runs extensive, configurable subsystem tests independent of
SunOS. Run the Sundiagnostic Executive if all tests pass when
you run SunDiag. Running the Sundiagnostic Executive allows
you to troubleshoot which field replaceable unit needs to be
replaced. See the latest version of Sundiagnostic Executive
User's Guide for Desktop SPARCstations for more information.

On-Board Diagnostics

You have access to a number of tests in your system called On-board diagnostics. To invoke these tests, you must enter the Forth Toolkit.

Caution -

In order to run On-board diagnostics, you must halt the system correctly. When the operating system or any other stand-alone program has already booted, do not use the L1-A keys to halt the system. Abruptly aborting program execution may cause damage to data files.

To run On-board diagnostics:
    1. Save all your work and quit all applications.
    2. As root, halt the system by entering /usr/etc/halt.

    You are presented with either the prompt or the ok prompt. The prompt is the default prompt. If you want to change the default prompt to the ok prompt, see the Introduction to Open Boot PROM 2.0.

    If you see the prompt, go to the next step. If you see the ok prompt, go to Step 4.

    3. Enter n to enter the Forth Toolkit.

    The ok prompt shows that you are in the Forth Toolkit.

    4. Enter help diag to list on-board diagnostic tests.

The following figure summarizes the steps you need to take to halt the system, enter the Forth Toolkit, and list the diagnostic tests.

    Figure 2-1 Halting the System and Displaying On-Board Diagnostics

These on-board tests allow you to test the network controller, the diskette drive system, memory, the SCSI bus connections, and the system clock. See "Diagnostic Routines" in the Introduction to Open Boot PROM 2.0 for a detailed description and step-by-step instructions of the available on-board diagnostic tests.

To return to the Monitor, prompt, enter the following:

SunDiag System Exerciser

The SunDiag System Exerciser, which runs under SunOS, displays real-time use of system resources and peripherals such as Desktop Storage Packs and Desktop Storage Modules. To verify that the system is functioning properly run the SunDiag System Exerciser.

This section is divided into two parts. The first half provides a brief overview of the SunDiag diagnostic program and explains how to start and run tests. The second part provides specific information on running each test for Dinnerbox. For a detailed description of the program and the individual tests, see the latest version of the SunDiag User's Guide, shipped with SunOS documentation.

An Overview of SunDiag

The exerciser is shipped with SunOS and is preinstalled on SPARCstations equipped with at least one internal hard disk drive. If the SunDiag System Exerciser is not found on the system, you can load it from tape or compact disc. It can be run at any time.

How to Run SunDiag

The primary way to run SunDiag test is from the windowing system (see Figure 2-2). This is the easiest way to run SunDiag tests, and the only one described in detail in this section.

SunDiag also has a TTY interface (non-window system) to run tests from a terminal, and individual tests can be run from the command line of a SunOS shell. See the latest version of the SunDiag User's Guide for details.

The SunDiag Window

When you start SunDiag, in SunView, the SunDiag window appears on your screen (see Figure 2-2). The window is divided into four sections:

    Figure 2-2 The SunDiag OPEN LOOK Window

Starting the SunDiag SunView Window

Follow these steps to run the SunDiag SunView user interface.

    1. Become superuser.
    2. If you are running Open Windows, exit the window mode.
    3. Open a shell and type the following command to start the SunDiag program and bring up the window interface:

    This command starts the basic SunDiag program. There are a wide variety of options to this command; see the SunDiag User's Guide shipped with SunOS documentation, for details.

    4. Check the control panel on the right side of the SunDiag Window (see Figure 2-3).

    SunDiag automatically probes for available devices. A test is enabled if the frame around looks "pushed in" (or bold for black and white monitors). You can toggle the tests from enable to disable and back by clicking the left mouse button. Move the pointer to the test and click left once.

    If a device is missing from the control panel, make sure that the device is properly installed.

    Figure 2-3 Typical SunDiag Control Panel

Note - SunDiag enables all available tests when it is invoked. Starting all available tests slows down your system considerably. You can avoid most problems by disabling the virtual memory (vmem) test.

    5. Click the Options window button for a device you want to test. An option menu will appear (See Figure 2-4).

    Figure 2-4 Typical Test Option Menu

    6. If the default settings on the option menu are acceptable, click the Apply button and the menu disappears. Then go to Step 8.

    7. If you need to change any of the option settings, click left to toggle through the choices. After setting the options, click the Done button. The changes are saved and the window disappears. To restore the default option settings, click the Default button.

Note - You only need to change the options settings once. SunDiag continues to use your selections each time you run SunDiag.

    8. Repeat Steps 5 and 6 for each test that you want to run.
    9. Press the Start button at the top left of the control panel to start testing. Once testing starts, the Start button becomes the Stop button. Click Stop to halt testing.

Note - Sundiag enables all available tests when it is invoked. Starting all available tests slows down your system considerably. You can avoid most problems by disabling the virtual memory (vmem) test. Use the mouse to toggle the checkmark to "unselected".

SunDiag Tests for Dinnerbox

The following SunDiag tests are used to test the Dinnerbox system configuration.

These tests are described in detail below.

Tape Drive Test (tapetest)

This test writes a pattern to a specified number of blocks (or, for a SCSI tape, writes to the end of the tape). It then rewinds the tape and reads/compares the information just written. The drive can also do a multi-file test called filetest.

Note - If you have a tape drive in your system, load a blank writable tape before you start SunDiag. If you fail to do so, SunDiag will show drive type:unknown on the option menu for the tape test.

SunDiag supports 8 mm, quarter-inch cartridge, and front-load tape drive testing. Examples of the option menus for those devices are shown below.

When you click on the tape test Option button, this menu appears:

    Figure 2-5 tapetest Option Menu (Exabyte 8 mm)

Note - This test may take a while to stop after being disabled

The example above depicts the option menu for an 8 mm tape drive. This menu differs from other tape drive option menus in that it has no format or reconnect option choices.

tapetest Options

Format:

QIC-11 and QIC-24 are tape formats that this test uses when it writes to the scratch tape you have inserted. QIC-11 uses a 1-byte block ID while QIC-24 uses 4 bytes, meaning that each block on a QIC-24 tape is uniquely identifiable.

Use a standard scratch tape for this test.

    QIC-11 format is the default testing format. If you cycle through the options, you may choose QIC-24 only, or both of QIC-11 and QIC-24 formats. If you choose both, the test will first write one pass to the tape in QIC-11 format, and then write a second pass over it in QIC-24 format.

Density

The following settings are available for 8 mm Tape Drives:

EXB8200 Writes 2.3 Gbyte of data to the tape. EXB8500 Increases the density and writes 5 GB of date to the tape. Both Writes and reads the 2.3 GbyteB format, and writes and reads the 5 GB format to the tape.

For half-inch tape drives the available settings are 800, 1600, and 6250 BPIs and ALL. For 60 mb quarter-inch tape drives the available settings are QIC-11 and QIC-24. For 150 mb quarter-inch tape drives the available settings are QIC-150.

Mode

If you toggle to Write/Read, the test first writes to the tape and then reads it back to compare.

If you toggle to Readonly, the test assumes the tape has been properly written and merely reads and compares, which is useful to check head alignment.

Length

The amount of the tape to be tested. The choices are:

EOT The default; tests to the entire tape. Long The SCSI tape tests 70,000 blocks (35 MB) of the tape. Short Only the first 1000 blocks are tested. Do not use this for 8 mm tape drives (the tape life will be extremely low). Specified You must enter the number of blocks to be tested in the
# of blocks item.
# of Blocks

If you selected specified under the Length option, you must enter the number of blocks you wish to test.

File Test

The tape file test writes three files, rewinds and then reads part of the first file and forward spaces to the start of the second file, reads the second file, forward spaces to the start of the third file, and tries to read to the end of that file. For SCSI tapes only, the test then tries to back space to the start of the second file and read it. This tests the ability of the drive to handle multiple files (dump sessions) on one tape.

Streaming

When streaming is disnabled, the test runs very slowly to insure the tape drive does not stream. When streaming is enabled the drive may stream except for large multiple tape drive systems or heavily loaded systems.

Reconnect

When you enable Reconnect: from the option menu, SunDiag attempts to check whether or not disconnect/reconnect is working, on a Sun-3/50 or on a system with a SCSI3 board and both SCSI disk and tape devices. The test forks a child process to test the SCSI disk, which sets up a signal handler to catch a signal from the parent process. It then sleeps for three seconds, reads in two blocks from the disk, sleeps again, and expects to have received a signal from the parent process in the interim. The parent process retensions the tape and then notifies the child. All tape drive operations use Reconnect on the current Sun platforms.

Clean Head
If you want to enable head cleaning, cycle the arrows to the Enable setting.
#of passes

If you have enabled the head cleaning option, you must enter the number of test passes SunDiag should execute before suspending testing to provide time to clean the tape drive head.

tapetest Command Line Syntax

/usr/diag/sundiag/tapetest D=/dev/device_name b=block_count sq nr ro ns ft rc standard_arguments Arguments D=/dev/device_name
device_name
is the name of tape device to be tested. b=block_count
This value specifies the number of blocks to be tested. sq Tells the test to switch the tape format nr Specifies no retention. ro Specifies a read-only test. ns Tells the test no tape sleep. ft Enables a file test rc Enables a reconnect test.

tapetest Quick Test Description

If the -q (quick test) standard_arguments option is used when tapetest is started from a command line, an abbreviated version of this test is run. The abbreviated version tests three large blocks (512x126 bytes) and two small blocks (512 bytes) of memory, and will not sleep for 60 seconds. Otherwise the test procedure is unchanged.

Disk Tests (rawtest and fstest)

rawtest Test Description

This test performs read-only and read-write tests on local disks, using generic SunOS routines.

This test supports all types of Sun disk drives, such as SCSI disks, native or SCSI floppy, Xylogics 7053 disk drive, and so on. The type of drive under test is named at the top of the option menu. For example, when an Intelligent Peripheral Interface is installed on the system under test, the Disk Option Menu header will name the IPI drive (for example id0).

Caution -

If a power failure occurs while rawtest is being run in read-write mode, disk data will be destroyed

If rawtest and filetest are run concurrently, rawtest must be set to read- only mode to avoid data loss.

Option Menu

If you click on a control panel disk Option button, this menu appears:

    Figure 2-6 rawtest and filetest Option Menu

Subtests

This test is unique in that the two Sub-tests, Rawtest and Filetest are run independently from command lines as described below.

Rawtest

See the Options section below.

Filetest

This test exercises the disk controller and drive. It writes two .5 MB files with an incrementing data pattern in /tmp, then reads and compares them. If the system has no /tmp directory, SunDiag creates a /tmp-disktop-sd directory to fill with data, and removes it when you exit. SunDiag selects the drive by issuing the SunOS command df and uses each local drive listed. All disks to be tested should be mounted prior to starting SunDiag.

Note - If there is a power loss or system crash while running fstest, that test will leave temporary files in the system /tmp directory.

Options

The following options can run on either sub-test.

Rawtest Mode

    You may choose read-only testing, read-write testing, or both. If you choose read-only testing, rawtest is executed in read-only mode. If you choose read-write testing, rawtest test is executed in read-write mode.

Rawtest Partition

    Specify the device partition to test.

Rawtest Size

    This option is for stress testing the device. Reducing the test size automatically creates additional processes to exercise the device. Specify the size in Megabytes.

rawtest Command Line Syntax

/usr/diag/sundiag/rawtest D=device_name W P=partition S=size standard_arguments Arguments D=device_name
device_name is the device to be tested. W This option causes rawtest to be run in read-write mode; the default (without this option) is read-only. P=partition
partition is the partition to be tested. The default is partition 2. S=size size is the size of the partition (100MB-1200MB). Default is the entire
partition. (See Rawtest Size above).

fstest Command Line Syntax

/usr/diag/sundiag/fstest /dev/device_name pdata_pattern standard_arguments Arguments /dev/devicename
device_name
specifies the name of the device to be tested. pdata_pattern
data_pattern
is the data pattern specified by one of the following arguments:
s sequential,
0 all zeros
1 all ones
a all a's,
5 all fives
r random.

Sundiagnostic Executive

The Sundiagnostic Executive is an independent operating system. It runs exhaustive subsystem tests independent of SunOS. The Sundiagnostic Executive, which provides you with thorough diagnostics, is described in the Sundiagnostic Executive User's Guide for SPARCstations.