C H A P T E R  3

Software Notes and Issues

This chapter describes software issues related to the Sun Blade X6220 server module. It includes the following subjects:


General Software Issues

This section lists issues that are not specific to any operating system or apply to more than one operating system.

Some Arguments in suncfg Tool Cause System Hang (6653448)

If you use the following arguments with the suncfg tool, the system will hang:


-get_all
-get_ilom_network_config 
-set_ilom_password
-set_ilomo_network_config

The suncfg tool will not be included in the SW 1.2 Tools and Drivers CD.

 


Solaris Issues

This section lists issues that are specific to the Solaris operating system.

Hot Swap Function for Some PEMs Does Not Work in Solaris 10 10/08 (6832875, 6809171, 6810312, 6810949)

During a hot-swap, if you insert any of these PEM’s:

SG-(X)PCIE2FCGBE-E-Z, SG-(X)PCIE2FCGBE-Q-Z, or X7284A-Z

you might encounter configuration failure errors.

In the Solaris message file, you might see the following error message:


pcie: [ID 323744 kern.warning] WARNING: PCIE init err info failed BDF 0x8100

If you manually configure the PEM with the cfgadm command, it will return the following error message:


cfgadm: Hardware specific failure: configure failed

Workaround

Reboot Solaris 10 10/08 with PEM installed.

Checksum Error in UDP Application on Solaris 10 10/08 With X7287A-Z and X1028A-Z PCI Ems (6832874, 6771690)

The UDP application may show an RCR L4_CSUM_ERROR in the Solaris message file.

The incorrect UDP checksum causes this error in the Solaris message file on Solaris 10 10/08 with X7287A-Z and X1028A-Z PCI EMs.

Workaround

This issue will be solved in the next Solaris release. For now, install patch 139570-05 or 138899-07 to fix this problem.

NIC Path Names Change if the X7284A-Z PCI EM Is Inserted Into Slot 1 (6705317)

After you hotplug X7284A-Z PCI EM into Slot 1 and reboot Solaris 10 5/08, the NIC path name of the second on-board NGE interface will change to nge2 from nge1.

If a X7287A-Z PCI EM is also in Slot 0 as well, the NIC path name of the nxge interfaces might change to nxge4, nxge5, nxge6, and nxge7, from the original values os nxge0, nxge1, nxge2, and nxge3.

Workarounds

Or

FMA Error With A SG-(X)PCIE8SAS-EB-Z PCI EM (6717153)

After hotplugging an SG-(X)PCIE8SAS-EB-Z PCI ExpressModule (PCI EM) into Sun Blade X6220 server with S10 5/08 installed, a message similar to the following will display:

 


Jun 23 14:07:12 nsgsh-dhcp-217 fmd: [ID 441519 daemon.error] SUNW-MSG-ID: SUNOS-8000-1L, TYPE: Defect, VER: 1, SEVERITY: Minor
Jun 23 14:07:12 nsgsh-dhcp-217 EVENT-TIME: Mon Jun 23 14:07:12 CST 2008
Jun 23 14:07:12 nsgsh-dhcp-217 PLATFORM: Sun Blade X6220 Server Module, CSN: 0111APO-0749BZ
055C , HOSTNAME: nsgsh-dhcp-217
Jun 23 14:07:12 nsgsh-dhcp-217 SOURCE: eft, REV: 1.16
Jun 23 14:07:12 nsgsh-dhcp-217 EVENT-ID: 2e500640-3f7a-c7cc-d33e-d560f3b08735
Jun 23 14:07:12 nsgsh-dhcp-217 DESC: The EFT Diagnosis Engine encountered telemetry for which it is unable to produce a diagnosis. Refer to http://sun.com/msg/SUNOS-8000-1L for more information.
Jun 23 14:07:12 nsgsh-dhcp-217 AUTO-RESPONSE: Error reports from the component will be logg ed for examination by Sun.
Jun 23 14:07:12 nsgsh-dhcp-217 IMPACT: Automated diagnosis and response for these events will not occur.
Jun 23 14:07:12 nsgsh-dhcp-217 REC-ACTION: Run pkgchk -n SUNWfmd to ensure that fault management software is installed properly. Contact Sun for support. 

This message can be safely ignored. Later Solaris releases will eliminate this message.

Upgrade to BIOS 30 (or Higher) From BIOS 29 (or Lower) Causes Solaris NIC Path Change (6648545)

When upgrading Gemini BIOS from 29 (or lower version) to 30 (or higher), the PCI bus number of the second nge interface changes from 7b to 7c in Solaris. This changes the instance number from nge1 to nge2 in /etc/path_to_inst.

After the BIOS upgrade, you will need to use nge2 to refer the second nge interface, rather than nge 1.

Solaris OS Does Not Support Hot-insertion or Hot-removal of an Infiniband PCI EM (6424701)

When the Solaris OS is running, Infiniband PCI ExpressModules (PCI EMs) cannot be removed or installed.

Workaround

Shut down the Solaris OS gracefully before removing or inserting the Infiniband PCI EM.

PCI SERR is Logged When Solaris 10 Boots (6603801)

Every time the Solaris operating system boots, a PCI system error signal (SERR) is logged regarding the SAS3081E-S PCI-E-to-SAS adapter.

The error messages can be ignored.

Certificate Hostname Mismatch Causes Web Interface Services to Stop (6587000)

Users running Solaris 10 11/06 (or higher) installed, will see that Web interface console services are stopped after the system is reconfigured.



Note - The Web interface consists of the Sun Java Web Console, and the Sun Java Web User Interface Components. The Console provides a common point for Sun web-based system management applications to be registered and accessed.


Starting with Solaris 10 11/06, the Web interface starts automatically as an smf service when the OS boots. When a Web interface instance is first created, a self-signed x.509 certificate is generated based on the machine hostname. The hostname is stored in the CN Relative Distinguished Name (RDN) of the Distinguished Name (DN) of the certificate.

For SSL exchanges to succeed, the hostname in the CN RDN of the DN of the certificate must be the same as the hostname of the system. If you sys-unconfig a system and change the hostname, the Web interface x.509 self-signed remains associated with the previous hostname. Any SSL exchange between a client (i.e., wcadmin) and the web server will fail due to the hostname mismatch in the x.509 certificate.

Workaround

Run the following command to remove the entire instance of the web interface from the OS.

/usr/share/webconsole/private/bin/wcremove -i console

Removing the Web interface console instance also deletes the x.509 certificate. The next time the Web interface console is started, a new certificate is generated based on the current hostname.

SunVTS 6.3 Might Encounter an Issue in Solaris 10 Update 3

Systest in SunVTS 6.3 might run into known issue in libmtsk.so bundled in Solaris 10 Update 3.

Workaround

Install patch 120754-05 or later before running SunVTS 6.3 with Solaris 10 Update 3.

Opening a Kernel Debugging Session on the Default Console Cause a System Hang (6506791)

Invoking the Solaris kernel debugger with the command mdb -K -F, might cause the system to hang at high IOPL if the console is set to text, which is the default setting.

Workaround

Set the console to ttya. By setting the console to ttya. This causes the system to transfer control of the debugger to the serial console port.

AMD Erratum 131 Warning Message During Solaris Startup (6447850)

Solaris AMD x64 support includes a boot-time check for the presence of a BIOS workaround for the AMD Opteron Erratum 131. If Solaris detects that the workaround for Erratum 131 is needed but it is not yet implemented, Solaris logs and displays the following warning message:


WARNING: BIOS microcode patch for AMD Athlon(tm) 64/Opteron(tm) processor erratum 131 was not detected; updating your system’s BIOS to a version containing this microcode patch is HIGHLY recommended or erroneous system operation may occur.

Workaround

The Sun Blade X6220 server module BIOS implements a superset workaround that includes the workaround required for Erratum 131. This warning message can be safely ignored.



Note - This issue was resolved in Solaris 10 5/08.



Linux Issues

This section lists issues that are specific to the Linux operating system.

nxge Gigabit Ethernet Drivers For X1028A-Z and X7287A-Z PCI EMs on RHEL5.3 (6832823)

The following RPM packages are provided in the given directories for RedHat and SuSE respectively:

RedHat

/usr/src/redhat/RPMS/x86_64/sun-nxge-1.0-1.x86_64.rpm

SuSE

/usr/src/packages/RPMS/x86_64/sun-nxge-1.0-1.x86_64.rpm

To load the nxge drivers for X1028A-Z and X7287A-Z PCI EMs:

1. From http://www.sun.com/download/products.xml?id=45a593ce

extract the nxge driver from the Sun_10_Gigabit_Ethernet_driver_update_12.zip file.

2. To install the nxge binaries run the following command:

rpm -ivh sun-nxge-1.0-1.x86_64.rpm

The driver binary is installed as:

/lib/modules/<KERNEL VERSION>/kernel/drivers/net/nxge.ko

The config tool binary is installed as:

/usr/local/bin/nxge_config

The man page is installed as:

/usr/share/man/man7/nxge.7.gz

3. To load the module using modprobe:

# modprobe nxge

4. Add the nxge interfaces to the /etc/modprobe.conf file for loading at boot time.

alias <if_name> nxge

5. Use ethtool command to check the properties of each interface:

ethtool -i <if_name>

6. Assign an IP address to the interface by entering the following:

ifconfig <if_name> <IP_address>

7. Verify that the interface works. Enter the following, where <IP_address> is the IP address for another machine on the same subnet as the interface that is being tested:

ping <IP_address>

Drivers Not Available for X1028A-Z and
X7287A-Z PCI EMs for SLES 9 SP 4 64-bit (6706014)

Drivers are not currently available for the X1028A-Z and X7287A-Z PCI ExpressModules (PCI EMs) if you are running SLES SP4 64-bit.

Hot Swap of PCI EMs Might Not Work With RHEL 4 Update 6 64-bit (6721600)

When hot-plugging a PCI EM when running RHEL 4 Update 6 (both 32- and 64-bit), there will be a "acpiphp_glue: _HPP evaluation failed" message displayed. After several instances of this message, the hot swap can fail.

Workaround

If hot swap issues occur within this configuration, reboot the operating system to correct the problem.

Hot Plugging of a X7284A-Z PCI EM Does Not Work With SLES 9 SP4 64-bit (6706585)

If you are running SLES 9 SP4 64-bit, hot-plugging of the X7284A-Z PCI EM is not supported. You will need to power down the Sun Blade X6220 before installing the PCI EM.

Swap File Not Mounted for SLES 10 SP1 Under Xen FV (6655098)

When SUSE Linux Enterprise Server 10 SP1 is installed as a fully virtualized guest under Xen, the swap file might not be mounted automatically. This can cause application failure due to unavailable swap space.

Workaround

Modify the file system configuration as follows.

1. Edit /etc/fstab on the guest.

2. Change the following line:

/dev/disk/by-id/ata-QEMU_HARDDISK_QM00001-part1

to

/dev/hda1

3. Change the following line:

/dev/disk/by-id/ata-QEMU_HARDDISK_QM00001-part2

to

/dev/hda2

4. Save the file.

5. Reboot the guest operating system.

These changes should cause the swap file to be mounted automatically.

Onboard NIC Network Throughput is Less Than Expected With a Linux OS (6647098)

You might encounter less than the expected throughput of 125 MB/second with the onboard NVIDIA Gigabit Ethernet NIC on all supported Linux OSes.

Workaround

For Red Hat Linux:

1. In a text editor, open /etc/modprobe.conf

2. Add the line: options forcedeth max_interrupt_work=100



Note - This workaround only works on RHEL 5 U1. The workaround does not work for RHEL 4 U6. See CR6668885 for more information.


For SUSE Linux:

1. In a text editor, open: /etc/modprobe.conf.local

2. Add the line: options forcedeth max_interrupt_work=100

CF Access and Booting Not Supported for RHEL 5.0 (6639144)

RHEL5.0 has some bugs which might result in file system corruption during heavy I/O to the compact flash (CF). It is not recommended that you use RHEL 5.0 to access the CF for I/O intensive applications.

Workaround

Update your OS to RHEL 5.1.

NEM NIC Fails When Network Stress is Heavy With SLES10 SP1 and SLES9 SP3 (6610532)

If you are running an application on SLES10 SP1 and SLES9 SP3 that requires heavy network activity, the application program might fail on the NVIDIA Gigabit Ethernet NEM NIC.

Workaround

Use a PCI EM Intel NIC instead of the NEM NIC. Part numbers for these PCI EMs are X7282A-Z or X7283A-Z.

Full Red Hat Enterprise Linux 4U4 Installation Might not Fit on the Compact Flash (6564755)

If you select the “Everything” option during Red Hat Enterprise Linux 4 (RHEL 4) U4. the installation might require more space than available in the compact flash (8GB).

Workaround

Deselect some packages during the installation so that installation size requirement matches the size of the available storage.

RHEL 4U4, 4U5 and RHEL 5 Do Not Work With Root Partition Installed on LVM (6566104)

RHEL 4U4, 4U5 and RHEL 5 installation is not supported with a root partition on Linux Volume Manager (LVM) for a compact flash-based installation.

This installation will cause a kernel panic on bootup.

Workaround

Configure RHEL with root on a non-LVM partition.

During installation, choose manual partitioning using disk druid. Delete the existing (if any) partitions and create two new partitions 100M (mounted as /boot) and rest of the disk mounted as root (/)

Red Hat Enterprise Linux 5 Guest OS Might Hang Using Default Memory Allocation (6536456)

Some fully virtualized Red Hat Enterprise Linux 5 guests will hang when they are given 500 or 1000MB of memory.

Workaround

When setting up a RHEL 5 guest OS, make sure to allocate at least 512 or 1024MB of memory to the fully virtualized Red Hat Enterprise Linux 5 guests.

PCI Express Hotplug Requirement (6525667)

PCI Express hotplugging may not work on RHEL4 U4 and SLES10 operating systems.

Workaround

Execute the following command before hotplugging PCI ExpressModules:


# modprobe acpiphp

Linux Installation From CD Fails (6495764)

During the installation of a Linux or Windows operating system using a CD/DVD drive connected via USB, the following message might appear:


Insert CD 1 to drive is shown

Once the CD is inserted, the installation program might not recognize it.

Workaround

To avoid this issue, you must enable memory hole remapping in the BIOS setup as follows:

1. Power on or reboot the Sun Blade X6220 server module.

2. Press F2 when prompted to enter the BIOS Setup Utility.

3. Navigate to the Chipset menu.

4. Make the following selections in order:

a. NorthBridge Configuration

b. Memory Configuration

c. Memory Hole Remapping

5. Enable Memory Hole Remapping by pressing the + key until the value is set to Enabled.

6. Press F10 to save and exit the BIOS.

RHEL4 U4 Reboot Hangs Intermittently (6492128)

When a Sun Blade X6220 server module running the RHEL4 U4 operating system is rebooted, it could hang intermittently at different stages of reboot.

Workaround

Disable the ACPI2.0 objects in the BIOS by performing the following steps:

1. Select Advanced, CPU Configuration, ACPI2.0 objects, and then select disable option.

2. Reboot the system and disable kudzu from running by using the command:


# chkconfig --levels 345 kudzu off

RHEL4 U4 x64 Cannot Find CD Media Right Before Mediacheck Option (6497631)

If you are installing RHEL4 U4 x64 version from CD media using an external USB CD-ROM drive, the operating system might report that it cannot find the CD media right before the media check dialog.

Workaround

Enable memory hole remapping in the BIOS Setup Utility before installing the operating system.

Bootup Warning Message While Initializing TSC (6507275)

When the Linux OS starts, the following console message might appear for every CPU/Core in the system. You can safely ignore this message.


BIOS BUG: CPU#0 improperly initialized, has 2 usecs TSC skew! FIXED

SLES 10 Installation Might Cause a Blank Screen (6507278)

When installing SLES 10 with the Web interface installation selected, you might receive a blank screen. This occurs because the monitor or LCD screen cannot handle the high refresh rates chosen as default by installer.

Workaround

Choose one of the following workarounds:

Ethernet Device Reorders When Ethernet
PCI EMs are Inserted (6507284)

When Ethernet PCI EMs are inserted and configured, Linux automatically reconfigures the device numbers. For example, eth0 would be renumbered to eth5 or eth4.

Workaround

You can avoid this issue by binding the device number to Ethernet device MAC address. To bind the device number to an Ethernet device MAC address, perform the following steps.

1. Type the following command to find the Ethernet device MAC address:


# ifconfig ethx 

Replace x with the corresponding numeral like eth0

2. Record the Ethernet device MAC address.

3. Edit the ifcfg file /etc/sysconfig/network-scripts/ifcfg-ethx as follows:

a. Type the following command:

For example:


# vi /etc/sysconfig/network-scripts/ifcfg-eth0

b. Add the previously recorded MAC address.

HWADDR=XX:XX:XX:XX:XX:XX

For example, HWADDR=00:09:3D:00:23:8D

4. Save the file to make the modifications permanent.

Error Message Displayed in /var/log/messages (6514173, 6624282)

The following error message might display several times in /var/log/messages:


drivers/sub/input/hid-core.c: input irq status -71

The following message might also display in /var/log/messages after updating to the mptlinux-4.00.13.00-1 driver:


usr/src/packages/BUILD/mptlinux-4.00.13.00/drivers/message/fusion/mptctl.c::mptctl_ioctl() @636 - ioc1 not found!

These messages can be safely ignored.

Bootup Warning Message (6518362)

When the Sun Blade X6220 server module boots, the following message might appear on the screen.


Freeing initrd memory: 2470k freed not found

Workaround

This message can be safely ignored.

RHEL4 U4 Inaccessible Via the Service Processor (6492134)

RHEL4 U4 might not be accessible via the service processor (SP). This will happen when the ILOM service processor does not display serial output from Linux OS.

Workaround

You can avoid this issue by ensuring that the following conditions are met:

OFED1.1 Driver Does Not Compile on RHEL4 U4 and SLES 10 Operating Systems (6553612,
6529180, 6528151, 6528155)

The OFED1.1 Infiniband driver will not compile on RHEL4 U4 and SLES10 operating systems. Therefore the Infiniband PEM (X1288A-Z) is not supported when using these operating systems. You might receive the following error messages when trying to compile the Infiniband driver:


ERROR: Failed to execute: make -C /lib/modules/2.6.16.21-0.8-smp/build SUBDIRS=/var/tmp/IBGD//tmp/openib/infiniband CONFIG_INFINIBAND=m CONFIG_INFINIBAND_MELLANOX_HCA=m CONFIG_INFINIBAND_IPOIB=m CONFIG_INFINIBAND_USER_CM=n CONFIG_INFINIBAND_SDP=n CONFIG_INFINIBAND_DAPL=n CONFIG_INFINIBAND_DAPL_SRV=n CONFIG_DAT=n CONFIG_INFINIBAND_KDAPL=n CONFIG_INFINIBAND_KDAPLTEST=n CONFIG_INFINIBAND_SRP=n CONFIG_INFINIBAND_SRP_TARGET=n KERNELRELEASE=2.6.16.21-0.8-smp EXTRAVERSION=.21-0.8-smp V=1 modules
See /var/tmp/IBGD//tmp/openib/build_kernel_modules.log for more details

Warning Message During RHEL4 U5 Bootup (6603550)

When booting a server module running RHEL4 U5, the following error message might appear a few times:


Uhhuh. NMI received for unknown reason 2c.
Dazed and confused, but trying to continue
Do you have a strange power saving mode enabled?

This message can be safely ignored.


Windows Issues

This section lists issues that are specific to the Windows operating system.

Preinstalled Windows Server 2003 R2

The Windows Server 2003 R2 operating system might be preinstalled on your system. For more information, see Windows Server 2003 R2 Operating System.

Cannot Restore Video When Booted Without Dongle/VGA Cable (6534613)

If Windows 2003 Server is booted with the dongle unplugged, or the dongle plugged in but the VGA cable unplugged, video can only be restored by rebooting Windows. Windows requires that both the dongle and the VGA monitor cable both be plugged in before booting Windows.

Workaround

Before booting Windows, make sure the dongle and the VGA monitor cable are connected to the server module (blade).


VMware Issues

This section lists issues that are specific to the VMware operating system.

VMware ESX 3.0.1 Does not Support Ethernet Ports (6521550)

VMware ESX 3.0.1 does not support the onboard Ethernet interfaces. A device driver for the on board interfaces is not available. To use or install ESX, you must install the supported PCI ExpressModule for network interfaces.

Workaround

You must install the supported PCI ExpressModule for network interfaces to use or install ESX.

VMware ESX 3.0.1 Reverses Network Interface Numbers (6521559)

VMware ESX 3.0.1 numbers network interfaces differently than other operating systems. When a network PCI ExpressModule is installed, the system specifies interface 1 as vmnic0 and interface 0 as vmnic1.

For ESX to operate correctly, the system console must have network connectivity. By default, ESX assigns vmnic0 to the system console.

Workaround

You should ensure that network interface 1 is the top network interface, and that it is connected and operational.