C H A P T E R  2

Software Information and Issues

This chapter contains information related to software. To update your server to the latest available software, go to:

To determine operating systems certified or supported for the servers, go to:

This chapter contains the following sections:


2.1 Linux Operating System Issues

This section lists issues for Sun Fire V20z and Sun Fire V40z Servers running supported Linux operating systems. For all items, check the product web sites for future enhancements.

2.1.1 Create LSI IM Volumes Before Installing the OS

To prevent data corruption on Linux operating systems, create IM volumes before you install the OS. To create IM volumes, use the LSI Configuration Utility.

2.1.2 RHEL 4 U1 64-bit and NSV 2.4.0.2a

If you run RHEL 4 U1 (64-bit) with NSV 2.4.0.2a, it is necessary to modify the Makefile to point to the correct directory for the kernel source. The source code for the kernel in RHEL4 U1 (64 bit) is not located in the common area (/usr/src/linux) that usually points to the correct version.

To create a soft link to the correct version of the kernel source code, enter the following command:

ln -s full path to kernel source /usr/src/linuxi

2.1.3 Linux and AMD PowerNow! Capable Processors

If HPET is disabled on a system with AMD PowerNow! processors and a Linux operating system, follow the instructions below to configure the OS appropriately.

2.1.3.1 RHEL 4 (AMD64) / RHEL 4 U1 (AMD64)

Red Hat Enterprise Linux 4 does not support AMD PowerNow! at this time. AMD PowerNow! must be disabled on all systems.

To disable AMD PowerNow!:

1. Enter the following:

chkconfig --level 12345 cpuspeed off

2. Reboot the server.

2.1.3.2 RHEL 4 (x86) / RHEL 4 U1 (x86)

RHEL 4 does not support AMD PowerNow! at this time. AMD PowerNow! does not default to enable on Red Hat Enterprise Linux 4 (x86), so requires no action.

2.1.3.3 SLES 9 SP2 or Newer (AMD64)

AMD PowerNow! defaults to enable on all systems, so requires no action to enable AMD PowerNow! functionality.

2.1.3.4 SLES 9 SP2 (x86)

To enable and use PowerNow!:

1. Add the boot parameter, clock=pmtmr to the file /boot/grub/menu.lst.

2. Reboot the system.

2.1.3.5 SLES 9 SP1 (AMD64)

AMD PowerNow! must be disabled on all systems.

To disable AMD PowerNow!:

1. In the /etc/sysconfig/powersave/common file, replace

POWERSAVE_CPUFREQD_MODULE=””

with

POWERSAVE_CPUFREQD_MODULE=”off

2. Reboot the system.

2.1.3.6 SLES 9 SP1 (x86)

Servers with dual-core processors:

This OS does not support AMD PowerNow! for dual-core processors.

To disable AMD PowerNow!:

1. In the /etc/sysconfig/powersave/common file, replace:

POWERSAVE_CPUFREQD_MODULE=””

with

POWERSAVE_CPUFREQD_MODULE=”off

2. Reboot the system.

 

Servers with single-core processors:

To enable and use PowerNow!:

1. Add the boot parameter clock=pmtmr to the file /boot/grub/menu.lst.

2. Reboot the system.

2.1.4 OpenSSH Version - Modification

Trusted hosts might not work on some implementations without a modification to the OpenSSH script:

/etc/ssh/ssh_config

Currently, the machine uses this version:

OpenSSH_3.6.1p2, SSH protocols 1.5/2.0, OpenSSL 0x0090701f

To modify it, add this to the script:

EnableSSHKeysign yes

2.1.5 PCI-X Hot-plug Support on Linux

PCI-X hot-plug support on Linux requires kernel support as well as a driver to enable the hot-plug capabilities of the AMD-8131 PCI-X chipset element. Linux support of PCI-X hot-plug is an evolving area and a tight interdependency exists between kernel versions and hot-plug drivers. Therefore, be careful to use the correct configuration for your systems hardware/software combination. To download the AMD 8131 PCI-X Tunnel Standard Hot Plug Controller (SHPC) driver, go to:

http://www.amd.com/us-en/Processors/TechnicalResources/0,,30_182_871_9034^9504,00.html

Within the kernel source of your distribution, read the file named pci.txt for additional information about the PCI support layer in Linux.

2.1.6 PRS in This Release

The PRS version included in this software release impacts new products only, because PRS is not field update-able.

2.1.7 Service Processor and Platform Clocks

To ensure that the Service Processor and the platform OS time zone data are in sync, set the platform RTC in Greenwich Mean Time (GMT), and use an OS mechanism to adjust platform time to the local time zone.

2.1.8 SNMP Platform OS Agent, Recommended, Reverse Proxy Configuration

Installation of certain platform OS SNMP agents might cause a review of SP-GROUP-MIB.mib (1.3.6.1.4.1.9237.2.1.1.7) to time out, in the reverse proxy configuration. To avoid this problem, install platform OS SNMP agent net-snmp v5.1.2, only.

2.1.9 Subcommand savedump Causes Watchdog Timeout Error

With more than 4 Gbyte of memory, if you use the savedump subcommand of lcrash, the 4300 and the 2100 hang, and a watchdog timeout error occurs. To prevent this problem, do not use the savedump subcommand. Instead, use the lkcd dump command.

2.1.10 SUSE Memory Bandwidth Performance on Dual Core

In order to obtain optimal performance for dual-core systems, customers should ensure that they use the latest kernel patch. The patch is included in SP1 or later.

If you prefer to install the patch, rather than upgrade to SP1 or later, follow the instructions below.

1. Log on to the SUSE Portal at: http://www.novell.com/linux/suse/portal/

2. Select the Patch Support Database link in the right column of the SUSE Linux Portal page.

3. Select By Product on the SUSE Patch Support Database page.

4. Select SUSE Linux Enterprise Server 9 for AMD64 (x86_64) on the Distributions page.

5. Select the most recent kernel patch.

6. Follow the instructions to download and install the packages for SUSE CORE 9 for AMD64 and Intel EM64T (x86_64).

2.1.11 Trident Video Driver with RHEL 3 Might Cause a System Lockup

The Trident server video driver, shipped with Red Hat Enterprise Linux (RHEL) 3.0, might cause a system lockup under certain conditions. These conditions are described in Red Hat’s Bugzilla database at:

https://bugzilla.redhat.com/bugzilla/show_bug.cgi?id=113533

To avoid this issue, use the VESA X server video driver bundled with RHEL 3.0.

To select the VESA driver during installation, proceed normally until the Graphical Interface (X) Configuration screen appears. Then expand the Other drivers menu and select “VESA driver (generic).”

To select the VESA driver after installation, switch from using a Trident driver to a VESA driver. For detailed steps, see the RHEL documentation.

2.1.12 The 32-bit version of RHEL 3.0 Does Not Recognize More Than 4 Gbytes of Physical Memory

The 32-bit version of RHEL 3.0 does not recognize more than 4 Gbyte of physical memory, even when more than 4 Gbyte is installed. This is a limitation of the default kernel.

Some other 32-bit versions of Linux have trouble recognizing more than 4 Gbyte of memory because of limitations of their default kernels. If your OS demonstrates this issue, contact your OS vendor for instructions to get the correct kernel to support your memory configuration.

2.1.13 Linux Server Management Requires a Customized OpenIPMI Linux Kernel Driver

See the Sun Fire V20z and Sun Fire V40z Servers-Server Management Guide for detailed instructions on installing this customized OpenIPMI driver.

2.1.14 Operating Systems Distributed on DVD Rather than on CD-ROM.

For the Sun Fire V20z Server, it is recommended that you obtain the optional CD/DVD drive (X9260A) to install software from DVD media.



Note - The Sun Fire V40z Servers have only the DVD/diskette drive as an option.


2.1.15 SUSE Linux Enterprise Server 8 Install From CD media using the X-windows-based Installation Utility

If SUSE Linux Enterprise Server 8 (SLES 8) is installed from the CD media using the X-windows-based installation utility, a problem might occur during the post-installation steps. The X-windows-based installation utility might revert control to the primary console and display an error message about the ps command. If this happens, you can return control to the X-windows-based installation utility by pressing Ctrl-Alt-F7 simultaneously on your keyboard. At this point you can proceed with normal post-installation setup with SLES 8.

2.1.16 SCSI HD Installation

If you have a single SCSI hard drive, the drive can be inserted in any slot. If you have two or more hard disk drives, install the drive with the OS boot sector in the lowest-numbered slot among the populated slots.

2.1.17 SUSE Memory Bandwidth Performance on Dual Core Systems

To obtain the best performance for dual-core systems running SLES 9, install SP1 or a later service pack. If this is not an option, install patch-9962, released 21 March 2005. The patch is available on the Novell web site at:

http://support.novell.com/techcenter/search/search.do?cmd=displayKC&externalId=2558830537429cdedb543926fd6344a8html



Note - Patch-9962 is not required for systems running SLES9 SP1 or later.


2.1.18 Translation Look-Aside Buffer (TLB) Reload Causes Errors With Certain Linux Software



Note - If you use Red Hat Enterprise Linux (RHEL) 3.0, install the most recent OS update on the server to minimize this issue.


In the BIOS Advanced menu, the “No Spec. TLB Reload” option is disabled by default. This setting allows the Translation Look-Aside Buffer (TLB) to reload.

With the default setting, errors similar to the following were observed on systems running any 64-bit version of Red Hat Linux and also SUSE Linux with
Service Pack 1.


Northbridge status a60000010005001b
GART error 11
Lost an northbridge error
NB status: unrecoverable
NB error address 0000000037ff07f8
Error uncorrected

To avoid these errors, disable TLB reloading:

1. Reboot the server and press the F2 key to enter BIOS setup.

2. Navigate to the Advanced > Chipset Configuration BIOS menu.

3. Use the arrow keys to scroll down to the option No Spec. TLB reload and change its setting from Disabled to Enabled.

This disables TLB reloading and eliminates the error message.

2.1.19 SLES 9 (64-bit) in Failsafe Mode Does Not Install on Sun Fire V40z Dual-Core

SUSE Linux Enterprise Server 9 64-bit currently does not work with the boot option of maxcpus=0, the default option for failsafe mode for the Sun Fire V40z dual-core.

Workaround

After the BIOS finishes booting, a graphical boot screen appears with three options: Linux, Floppy, or Failsafe.

1. Select “Failsafe”.

2. Click in the small text edit box below the options.

3. Scroll to the end of the line.

4. Edit the text: change from “maxcpus=0 3” to “maxcpus=3”.

Do not remove the number “3”.

2.1.20 Installing RHEL4 FCS (32-bit) on Sun Fire V40z (Chassis 380-1010) With NSV 2.4.0.6 Requires Disabling HPET

To install Red Hat Enterprise Linux 4 FCS (32-bit) on Sun Fire V40z (chassis 380-1010) with NSV 2.4.0.6, the High Precision Event Timer (HPET) must be disabled in the BIOS. This modification is not necessary with RHEL4 Update 1.

Workaround

To disable the HPET in the BIOS:

1. Press the F2 key to enter the BIOS Setup Utility.

2. In the Advanced menu, select the HPET Timer option.

3. Change the value to Disabled.

4. Press the F10 key to save your changes.


2.2 Solaris Operating System Issues

This section lists issues and considerations regarding Sun Fire V20z and Sun Fire V40z servers using the Solaris operating system. For all items, check the product web sites for future enhancements.

The first compatible version of the Solaris OS is Solaris 9 OS 4/04 or later for the Sun Fire V20z Server and Solaris 9 OS HW 4/04 for the Sun Fire V40z Server. However, certain functionality might be phased in after the initial product release of the server:



Note - The Sun Fire V40z servers have only the DVD/diskette drive as an option.


http://www.sun.com/servers/entry/v20z/downloads.html
http://www.sun.com/servers/entry/v40z/downloads.html

2.2.1 Create LSI IM Volumes Before Installing the OS

To prevent data corruption on the Solaris ZFS operating systems, create IM volumes before you install the OS. To create IM volumes, use the LSI Configuration Utility.

2.2.2 Solaris 10 Telnet Vulnerability Patch

Solaris 10 6/06 and 11/06 security vulnerability in the in.telnetd(1M) daemon might allow unauthorized remote users to gain access to a Solaris Host. Patch# 120069-02 addresses this issue. Apply the patch manually or run the install.sh script on all Solaris 10 6/06 and 11/06 distributions. For more information, please refer to Sun Security Alert 102802. This patch will be incorporated into the Solaris preinstall image at a later date.

2.2.3 GUI-Based Solaris 10 1/06 OS Installation Fails on Systems Fully Populated With 4 Gbyte DIMMS

Solaris 10 1/06 OS GUI-based installation fails on Sun Fire V40z servers with 64 Gbyte of memory installed (fully populated with 4 Gbyte DIMMs).

Workaround

The installation does not fail if you use the console text-mode installation.

2.2.4 Solaris 9 OS Limited to 32 Gbyte Memory

Solaris 9 OS is a 32-bit OS and is limited to 32 Gbyte of memory. However, addressing 32 Gbyte of physical memory requires a large portion of the 32-bit address space. Applications might not have access to sufficient physical memory.

Workaround

Sun recommends running the Solaris 10 OS for applications requiring large amounts of physical memory.

2.2.5 Preinstalled Solaris 10 OS

Some versions of the Sun Fire V20z and Sun Fire V40z Servers ship with a preinstalled version of the Solaris 10 OS.

If you want to remove the preinstalled version of the Solaris 10 OS from your server, you can simply overwrite it by installing a version of the Linux OS. During the Linux installation process, a warning message that begins as follows might display:

Warning. Unable to align partition properly.

Incorrect partition labels from the preinstalled Solaris 10 OS cause this message, but you can safely ignore it. The error is corrected after a Linux installer changes the partition table.


2.3 Sun Installation Assistant Issues

An operating system can be installed on the server without configuring the service processor or the network share volume (NSV) software. However, if you choose to skip configuration of the service processor and the NSV software, you will not be able to use the remote management capabilities of the system or the diagnostics.

2.3.1 Sun Installation Assistant CD-ROM

The Sun Installation Assistant CD-ROM helps you to install a supported Linux operating system (OS). It provides a set of Sun-supported drivers that are tested for quality assurance. By using the Sun Installation Assistant CD, you can install the operating system, the appropriate drivers, and additional software on your system. The Assistant eliminates the need to create a Drivers Update diskette.

The Sun Installation Assistant for Sun Fire V20z and Sun Fire V40z Servers CD-ROM might be included in your accessory kit.

You can download the .iso image for this CD-ROM from the product web sites at:

http://www.sun.com/servers/entry/v20z/downloads.jsp

http://www.sun.com/servers/entry/v40z/downloads.jsp



Note - Sun encourages customers to visit the Downloads web site. When the Sun Installation Assistant is updated to support additional versions of the Linux OS, an updated .iso image will be posted to this site.


Information about using the Sun Installation Assistant CD-ROM can be found in Chapter 2 of the current version of the Sun Fire V20z and Sun Fire V40z Servers--Linux Operating System Installation Guide.

2.3.2 Linux Versions Supported by the Sun Installation Assistant (SIA)

At the time of this printing, the following versions of the Linux OS are supported by the Sun Installation Assistant:


TABLE 2-1 Linux OS Versions Supported by the Sun Installation Assistant

Linux Distribution

Version Supported

32-bit

64-bit

Single-Core Systems

Red Hat Enterprise Linux 3, WS/ES/AS

Updates 1 - 4

32-bit

Red Hat Enterprise Linux 3, WS/AS

Updates 1 - 4

64-bit

Red Hat Enterprise Linux 4, WS/ES/AS

 

32-bit

Red Hat Enterprise Linux 4, WS/ES/AS

Update 1

32-bit, 64-bit

SUSE Linux Enterprise Server 9 (SLES9)

SP 1, SP2

64-bit

SUSE Linux Enterprise Server 8

SP 3

64-bit

Dual-Core Systems

Red Hat Enterprise Linux 4, AS/ES/WS

Update 1

32-bit, 64-bit

SUSE Linux Enterprise Server 9 (SLES9)

SP 1, SP2

64-bit




Note - Sun Installation Assistant does not support Red Hat Enterprise Linux 3 Update 8 and SUSE Linux Enterprise 10. These operating systems can be installed directly without using SIA.


2.3.3 Using Logical Volume Manager With the Sun Installation Assistant

The Sun Installation Assistant does not support using Logical Volume Manager (LVM) with Red Hat Enterprise Linux 3 and updates or SUSE Linux Enterprise Server 8 or 9 and service packs. The current release available from the download site supports LVM with Red Hat Enterprise Linux 4. Future releases will support LVM with SUSE Linux Enterprise Server 9 and service packs.

2.3.4 Platform Drivers

If you do not install the platform drivers and a correctable ECC error occurs, duplicate error messages of the most recent ECC failure are reported indefinitely.

To avoid this issue, ensure that you install the appropriate level of platform drivers on your server. For more information, refer to the Sun Fire V20z and Sun Fire V40z Servers--Installation Guide.

2.3.5 LSI Driver



Note - This issue is resolved in the BIOS update within NSV 2.2.0.6h and newer releases.


If you installed the SUSE Linux Professional 9.0 or SUSE Linux Enterprise Server 8 (SLES8) OS on your server, and are running the LSI driver version 2.05.11 and firmware version 1.03.15, you might encounter performance issues on the internal hard disk drives (HDDs).

Sun recommends updating to LSI driver version 2.05.16 and firmware version 1.03.23. You can use these versions of the driver and firmware for all supported operating systems.

2.3.6 OpenIPMI Linux Kernel Driver Conflict During Diskette Access



Note - This issue is resolved in NSV versions 2.2.0.6h and higher.


If you are using in-band IPMI functionality with your server, you must unload the OpenIPMI Linux kernel driver before accessing a diskette (floppy disk). If you do not unload the OpenIPMI Linux kernel driver before you access a diskette, diskette writes and management data that is handled by the OpenIPMI Linux kernel driver will be corrupted.

1. To unload the OpenIPMI Linux kernel driver, authenticate as root, then enter the following commands:

$ rmmod ipmi_kcs_drv
$ rmmod ipmi_devintf
$ rmmod ipmi_msghandler

2. After diskette access is complete, restore the in-band IPMI functionality by entering the following commands:

$ modprobe ipmi_devintf
$ modprobe ipmi_kcs_drv


2.4 Diagnostics Issues

The most current version of diags contains multiple bug fixes and is available at the following URLs:

http://www.sun.com/downloads

http://www.sun.com/servers/entry/v20z/support.jsp

http://www.sun.com/servers/entry/v40z/support.xml

2.4.1 Empty README and RESCUECD Files

In previous versions of the Diagnostics CD, the README and RESCUECD files served two purposes:

In current and future versions of CD-based diagnostics, these files no longer need to contain user-related information. However, the diagnostics kernel still requires the files to verify the match between the kernel and diagnostics application. As a result the files remain intact, but have no content.

2.4.2 CD-Based Diagnostics and Remote Access

Remote access requires the prior creation of a manager-level user on the platform. See "Create Trusted Host Relationships," in the Systems Management Guide for instructions. To establish a remote command-line interface for CD-based diagnostics tests, use SSH network access:

1. SSH to the platform IP address as the user, setup. If you created a manager-level user on the Service Processor, you are prompted for a username and password to create a new account. You can use any username except: diagUser, setup, or root. When your username and password validate, you are logged off of the system.

2. Now use your username and password to SSH to the platform.

3. To enable only platform diagnostics tests without loading the Service Processor tests, execute the command:

diags start -n



Note - For Service Processor-based diagnostics, the -n argument specifies: Do not boot the platform with diagnostics.


-or-

To enable both Service Processor and platform diagnostic tests, execute the command:

diags start

This command reboots the platform into diagnostics mode. Wait at least two or three minutes before you attempt to run the tests.

-or-

Implement one of the following in shell or Perl:

diags start
sleep 240
rc = diags get state
if (rc ==0)
then
# run desired tests using diags run tests command
else
echo "Diagnostics not loaded in expected time. rc = $rc"
fi

-or-

rc = diags get state
timer = 0
while (rc == 25 (device error)) and (timer < MAX_WAIT)
do

sleep SLEEP_TIME
timer=time+SLEEP_TIME
rc = diags get state
done

if (timer < MAX_WAIT)
then
# run desired tests using diags run tests command
else

echo "Error loading platform diagnostics. rc = $rc"
fi

4. To determine if the diagnostics tests are available to run, you can execute the command:

diags get state

The command returns either a success message:

  The Service Processor and the platform diagnostics systems are   available to receive test requests.

Or an error message:

  The platform diagnostics system is not available.

end
if re == 0
diags run tests -a

2.4.3 Diagnostic Tests Fail to Load

In some releases, after you’ve entered the command to execute diagnostics (diags start) some diagnostic tests might fail to load. You can check on the status of the diagnostics by entering the following command:

diags get state

If the diagnostics have loaded, the above command will return the following message:

The SP and Platform Diagnostics systems are available to receive test requests.

If the diagnostics haven not loaded, the above command will return the following message:

The Platform Diagnostics system is not available.

To resolve this issue, install NSV 2.4.0.6a.

2.4.4 Diagnostic Fan Tests are Platform Power Dependent

Platform state changes that are made after diags has been started night not be detected by fan tests, which are dependent on platform power. If you use service processor-based, non-platform mode “diags start -n,” set the desired platform state before you load diags.

2.4.5 Fan 10 Falsely Reports During Reboot Cycle

The sensor for Fan 10 reports 0 RPM during the reboot cycle, but then quickly reports a return to normal operation.

2.4.6 Packet Corruption Might Cause Failure of Diagnostic Download

Packet corruption might happen during diagnostics download. If this happens, platform-side diags never comes up. To resolve the issue, follow this procedure:

1. Stop all diagnostics by entering the command:

$ diags terminate

2. Verify that the server power is off by entering the command:

$ platform get power state

3. Start diagnostics by entering the command:

$ diags start

4. Repeatedly check the status of diags by entering:

$ diags get state

If the problem persists, call Sun Service for further assistance.

2.4.7 diags Fan Tests

Any platform state changes made after the diags command has been started might not be detected by fan tests, which are dependent on platform power. If you use Service-Processor-based, no-platform mode diags start -n, set the desired platform state before you load diagnostics.

2.4.8 diags SSH Connection and nic Test

Users might lose SSH connection with the platform when the retention.allDimms test runs. Diagnostics continues to run, but the user cannot SSH into the platform after the connection is lost. To avoid this problem, if you use an SSH connection, do not run the nic test.

2.4.9 Firmware Downgrades

Generally, downgrading firmware to a version lower than the version shipped with the machine is not supported.

2.4.10 sp snmp community Commands

NSV releases 2.4.0.6 and later support the following SP commands:

2.4.10.1 sp set snmp community

This command enables you to set the community name to be used by the service processor (SP), itself, as opposed to the proxy community string that is used between the SP and the platform.

There are no restrictions on the length of the community string. Typical names are “private” and “public.” The factory default name of the community string is “public,” so if you run the command sp get snmp community before you set a value, “public” is returned. Set the value to any string without spaces.
For example, for:

$ sp set snmp community COMMUNITY_STRING

Successful output is:

localhost# sp set snmp community private

2.4.10.2 sp get snmp community

This command returns the community name that is currently being used by the service processor. For example, with command:

$ sp get snmp community

Successful output is:

localhost# sp get snmp community public

2.4.11 Incorrect Install Date Returned With inventory get software Command

In rare cases, when the user enters the inventory get software command, the wrong date is displayed for the latest revision of the server diagnostics software. This does not affect functionality or performance of the product and can be safely ignored.

For example:

$ inventory get software
Name				Revision			Install Date							Description
Diagnostics				V2.3.0.9			Thu Feb 17 19:17:16 1938							Server Diagnostics
BIOS-V20z				V1.33.5.2			Mon Jun 06 14:00:11 2005							Platform BIOS for V20z servers
Operator Panel 				V1.0.1.2			Mon Jun 06 13:43:06 2005							Operator Panel Firmware
SP Value-Add				V2.3.0.11			Mon May 16 14:36:32 2005							SP Value-Add Software
SP Base				V2.3.0.11			Mon May 16 14:36:32 2005							SP Base Software
$
Workaround

If you need to access the correct install date for the server diagnostics software, perform the following command sequence:

1. Revert to the default settings of the SP by entering.

$ sp reset to default-settings -a

2. Wait for the SP to reboot, then create a manager account by connecting to the IP address of the SP.

$ ssh -l setup IP-ADDRESS

IP-ADDRESS The IP address of the SP when it comes back online.

3. Follow the prompts to create a manager account.

4. Log in to the manager account you just created.

5. Mount the SP containing the Server Diagnostics software.

6. Verify your access to the latest revision dates of the installed software.

$ inventory get software

The correct installation dates for the latest software revisions are now displayed.

For more information about creating a manager account or mounting the SP, see the Sun Fire V20z and Sun Fire V40z Servers--Installation Guide.

2.4.12 sp get tdulog and sp ssh1 Commands

The sp get tdulog command has been enhanced and the sp ssh1 commands have been added. For details, see the Sun Fire V20z and Sun Fire V40z Servers Server Management Guide.

2.4.13 Do Not Access the SP While Diagnostics Are Loaded



Note - This issue appears in NSV versions 2.1.0.16 and earlier. It is resolved in NSV 2.2.0.6 and later versions.


While running diagnostics on your server, do not interact with the service processor (SP) through the command-line interface or IPMI.

The sensor commands cannot be used reliably while the diagnostics are running. Issuing sensor commands while diagnostics are loaded might result in “false” or erroneous critical events being logged in the events log. The values returned by the sensors are not reliable in this case.

2.4.14 diags terminate Command Generates “False” Critical Events



Note - This issue appears in NSV versions 2.1.0.16 and earlier. It is resolved in NSV 2.2.0.6 and later versions.


Terminating the diagnostics generates critical events on the sensors and system errors. After the diagnostics are terminated and the platform is powered off (the diags terminate command does this automatically), Sun recommends that you clear these events from the event log so that you do not mistake them for actual critical events.

The diagnostics provided with the Sun Fire V20z or Sun Fire V40z Servers are designed for a user who is watching the screen, or for the output to be saved to a file.

TABLE 2-2 and TABLE 2-3 (generated by the SP command sp get events) show the events generated when you run the command diags terminate on your server. You can ignore all of the “critical” errors.


TABLE 2-2 Sample Event Log for a Sun Fire V20z Server
ID
Last Update
Component
Severity
Message
50
11/04/2004 16:49
Platform
informational
The platform has been powered off
51
11/04/2004 16:49
planar
critical
Sensor CPU VDDA voltage reports 0.00 Volts but should be between 2.25 and 2.75 Volts
52
11/04/2004 16:49
ps
critical
Sensor Bulk 12v supply voltage (cpu0) reports 1.33 Volts but should be between 10.20 and 13.80 Volts
53
11/04/2004 16:49
cpu0.vrm
informational
Sensor CPU 0 core voltage reports 0.76 Volts and has returned to normal
54
11/04/2004 16:49
cpu0.memvrm
critical
Sensor CPU 0 VDDIO voltage reports 0.00 Volts but should be between 2.25 and 2.75 Volts
55
11/04/2004 16:49
cpu1.vrm
informational
Sensor CPU 1 core voltage reports 0.76 Volts and has returned to normal
56
11/04/2004 16:49
cpu1.memvrm
critical
Sensor CPU 1 VDDIO voltage reports 0.00 Volts but should be between 2.25 and 2.75 Volts
<Log file continues...>
 
 
 


TABLE 2-3 Sample Event Log for a Sun Fire V40z Server
ID
Last Update
Component
Severity
Message
1
11/04/2004 14:31
ps1
informational
Power supply 1 has resumed normal operation 
2
11/04/2004 14:31
ps2
informational
Power supply 2 has resumed normal operation 
3
11/04/2004 15:03
planar
critical
Sensor Bulk 1.8V S0 voltage reports 0.00 Volts but should be between 1.62 and 1.98 Volts 
4
11/04/2004 15:03
cpuplanar
critical
Sensor CPU 2 VDDA (2.5V) S0 voltage reports 0.00 Volts but should be between 2.25 and 2.75 Volts 
5
11/04/2004 15:03
ps1
critical
Sensor Bulk 12V S0 voltage at CPU 0 reports 0.01 Volts but should be between 10.20 and 13.80 Volts 
6
11/04/2004 15:03
ps2
critical
Sensor Bulk 12V S0 voltage at CPU 0 reports 0.01 Volts but should be between 10.20 and 13.80 Volts 
7
11/04/2004 15:03
fan5
critical
Sensor Fan 5 measured speed reports 60 RPM but should be greater than 2000 RPM 
8
11/04/2004 15:03
fan6
critical
Sensor Fan 6 measured speed reports 60 RPM but should be greater than 2000 RPM 
<Log file continues...>
 
 
 

Workaround

The following steps provide a workaround to clear the false critical events from the event log.



Note - For a complete list of the SP commands, refer to the Sun Fire V20z and Sun Fire V40z Servers--Server Management Guide.


1. Before running the diagnostics, clear the SP events log.

2. Start the diagnostics.

Wait for the diagnostics to come up.

3. Run any or all of the diagnostics tests.

4. Check the SP event log for any errors.

The event log can be stored away for future investigation.

5. Terminate the diagnostics with the command diags terminate.

6. Clear the SP event log.

This step eliminates all of the false critical events that were generated in the previous step.



Note - For more information about clearing the event log, enter the command:
sp delete event --help


2.4.15 Other Diagnostics Issues

mount : Mounting /dev/fd0 on /mnt/floppy failed. No such device.

You can safely ignore this error message.



Note - The floppy-disk drive issue is resolved in the diagnostics included with NSV 2.2.0.6h and later releases.