C H A P T E R  2

LSI Firmware, Service Processor, and BIOS Issues

This chapter describes the LSI firmware, Sun Integrated Lights Out Manager (ILOM) Service Processor, and BIOS issues related to the Sun Fire X4100 and Sun Fire X4200 servers. It includes these topics:


LSI Firmware Issues

Resolved Issues

RAID Volume Disk Space Requirement for Metadata (6312581)

To create a RAID volume, the firmware and BIOS must write metadata at the end of the volume. Therefore, allow at least 64 MB of unpartitioned disk space for the metadata. If you are using the preinstalled Solaris operating system, more than 64 MB of space is unpartitioned for metadata.

HDD Resynchronization Completion is Indicated By Optimal Status in LSI Firmware Version 1.08 (6389986)

If you are using LSI firmware Version 1.08 or later, the resynchronization progress indicator might stay at 0%, even though the resynchronization is happening. The resynchronization is complete when Optimal is displayed as the status.

Workaround

None. This is expected behavior in LSI firmware Version 1.08 or later.


ILOM and Service Processor (SP) Issues

Current Issues

Updating to Current LSI Firmware and BIOS Requires ILOM 2.x or Higher (6947881)

If you do not have ILOM 2.x or higher, you will not be able to update LSI firmware and BIOS.

Workaround

First upgrade firmware to ILOM 2.x and then upgrade the LSI firmware.

Incorrect Power Supply Events After Service Processor Reset (6499282)

After the service processor is reset or the system power is cycled, incorrect power supply events may appear in the System Event Log. Here are some examples:

6500 | 11/10/2006 | 16:21:21 | Power Supply ps1.vinok | State Asserted
6600 | 11/10/2006 | 16:21:22 | Entity Presence ps1.prsnt | Device Present
6700 | 11/10/2006 | 16:21:23 | Power Supply ps0.pwrok | State Asserted
6800 | 11/10/2006 | 16:21:24 | Power Supply ps0.vinok | State Asserted
6900 | 11/10/2006 | 16:21:26 | Entity Presence ps0.prsnt | Device Present
6a00 | 11/10/2006 | 16:21:26 | Power Supply ps1.pwrok | State Asserted
Workaround

Power supply events that occur immediately after a Service Processor reset can be safely ignored.

Service Processor Fails to Log Event or Provide Visual Alert After HDD is Removed (6306536)

If an HDD is removed from a system, the service processor neither logs an event nor provides a visual alert. This is expected behavior because the service processor does not receive events from the LSI SAS controller when an HDD is hot-plugged and visual alerts do not occur.

BIOS Labels Not Shown in FRU List for Some Interfaces

The server labels and reports NIC information (MAC address, interface label, Chip type) via the Windows OS, BIOS and Service Processor as shown below. Note that some of the BIOS labels read “Not shown in SP.” This occurs even when the NICs are present.

 
Physical MAC     Windows OS   Chip      BIOS Label   BIOS MAC   Service Processor
Net 0 38         LAC4         Nvidia    0            38         mb.net0.fru
Net 1 39         LAC3         Nvidia    1            39         Not Shown in SP
Net 2 3A         LAC          Intel MT  2            3A         mb.net1.fru
Net 3 3B         LAC2         Intel MT  3            3B         Not Shown in SP
Workaround

Ignore the “Not shown...” message.

SP Does Not Handle SNMP Traps (6396525)

The SP does not support SNMP traps for the SUN-PLATFORM-MIB.

Workaround

Use IPMI PET traps, a form of SNMP trap, to trap errors. You can implement them by configuring alert rules in the service processor, or by configuring PEF rules directly in IPMI. See the User’s Guide for the ILOM version on your server for more information.

Resolved Issues

Resetting SP Without Rebooting OS Disables Console (6673674)

(Fixed in Software 2.0.)

If you reset the service processor without rebooting the operating system, you are unable to open a system console with the command start /SP/console.

Powering On With A Single Power Supply Lights Failure LED (6645564)

(Fixed in Software 1.4.)

Powering on the system with only one power supply in place causes a power supply failure error to be asserted. The front panel Power Supply Failure LED is lit, and the sensor sys.psfail is set.

Workaround

None.

ILOM 1.1.1 Console Speed, 115200 baud, Not Working (6499189)

(Fixed in Software 1.3.)

ILOM now supports configurable baud rates other than 9600. However, the 115200 setting does not work.

Workaround

Use a baud rate other than 115200.

Sun Fire X4200 M2: Nonexistent Fans Shown in CLI (6433119)

(Fixed in Software 1.3)

The ILOM service processor may show extra F1 entries in the CLI that do not really exist.

Workaround

Ignore these entries.

Fan Speed is Nonzero if Fan is Removed or Stopped (6493168)

(Fixed in Software 1.3.)

The server contains no mechanism for detecting the removal or full stop of a fan, and therefore reports a low RPM in these situations.

Workaround

None.



caution icon Caution - If you see any indication that a fan is not performing properly, investigate the problem thoroughly to prevent system damage.


Incorrect Fan Speeds Recorded for ft0.fm1.f0 and ft0.fm2.f0 (6542869)

(Fixed in Software 1.3.)

CLI and WebGUI report incorrect fan speeds for ft0.fm1.f0 and ft0.fm2.f0. All other reported fan speeds are correct.

Workaround

Use IPMItool to check the fan speeds.

bp.locate.btn and fp.locate.btn Are Asserted When Locate LEDs Are Not Blinking (6482084)

(Fixed in Software 1.2.)

IPMI service processor may report incorrect locate button state in the SDR.

Workaround

Please use the IPMI sys.locate entry to determine the state of the locate button.

Sun N1 System Manager Does Not Recognize System (6522517)

(Fixed in Software 1.1.)

Servers running ILOM 1.1.1 firmware, or later, cannot be discovered by N1 System Manager (N1SM) as managed nodes.

Workaround


Note - This workaround only applies to N1SM 1.3.2. Earlier versions of N1SM must be upgraded to version 1.3.2 first.


1. Acquire root privileges.

2. Create a subdirectory called resources at the following location:

/opt/sun/n1gc/drvrs/

3. Within resources, create a properties file called galaxy.properties

4. Add the following property to the file:

modelNamePattern=product.+name\\s+=\\s+(SUN\\s+FIRE\\s+(X4\\d+00\\s*M2|X4100|X4200|X4500|X4600))|(GALAXY\\s+\\d+|THUMPER) 

5. Restart N1SM using the following commands:

# /opt/sun/n1gc/etc/n1sminit stop
# /opt/sun/n1gc/etc/n1sminit start

CLI fault_state Properties Do Not Reflect Actual Fault States (6479882)

(Fixed in Software 1.3.)

New fault_state properties added to the ILOM CLI are not fully functional. Avoid using them.

Workaround

Use the IPMI System Event Log (SEL) and detailed ILOM CLI properties to determine the fault state of the system or components.

ILOM 1.1.1 JavaRConsole Will Not Launch (6494290)

(Fixed in Software 1.3.)

Sometimes, the JavaRConsole fails to launch when the user clicks on the “Launch Redirection” link in the Web GUI.

Workarounds

Three workarounds are available:

Downloading the Java Web Start File

1. Right-click the “Launch Redirection” link and choose “Save link as...”

A file save dialog appears, offering to save a file with a .jnlp extension.

2. Save the file, being careful to note its name and location.

3. Execute the .jnlp file.

4. Select one of the following options to execute the file:

C:\path\javaws.exe javarconsole.jnlp

JavaRConsole Restart or Start Redirection Fails After AC Power Removed (6434210)

(Fixed in Software 1.1.)

If you are using JavaRConsole and the AC power is cut off from the system, the console fails to restart when you choose the Restart or Start redirection options.

Workaround

Wait until a Lost Connection popup window is displayed (indicating that the console has been disconnected), then choose the Restart or Start redirection option to restart JavaRConsole.

Serial Console Access Over SSH Freezes on start -script Command (6337909)

(Fixed in Software 1.0.)

While connecting to the SP CLI mode via SSH, the serial console connection might intermittently lock up when the start -script command is used to log in to the SP console.

Workaround

Use the SP start console command to connect to the SP console, rather than start -script command.

If the serial console connection locks up, exit the SSH session then try again using the SP start console command.

Serial Port Speed Setting Reverts to 9600 BPS After Exiting CLI Session (6298521)

(Fixed in Software 1.1.)

When you exit a CLI session, the serial port speed is reset to 9600 bps. This might cause the serial port to fail after you exit a CLI session if the speed was set to a value other than 9600 bps.

Network Port Does Not Operate at 10 Mbyte/sec (6302923)

(Fixed in Software 1.1.)

The service processor Ethernet port operates only at 100 Mbyte/sec. It does not operate at 10 Mbyte/sec.

ILOM Web GUI Displays Incorrect Thresholds (6316706)

(Fixed in Software 1.1.)

The ILOM web GUI might display incorrect threshold values for the temperature sensors, for example. ILOM might also display random values for non-threshold sensors.

SP and USB Interactions Interrupt the OS (6277725)

(Fixed in Software 1.1.)

There are several methods you can use to reset the service processor, for example:

Any of these methods could interrupt the system or cause it to hang because of the USB plug/unplug events that are initiated between the SP and the system.

Workaround

To ensure that the service processor is reset and a USB event does not hang the system, configure the OS with minimal or no USB support.

Do not reboot the system while the service processor is resetting itself or the system could hang. Instead, change the mouse mode to the desired state before booting.

Scheduling service processor resets to occur only when the system is off or in reset will also prevent any service processor interaction with the system.

Kernel Error During Reset Hangs Some System Components (6295154)

(Fixed in Software 1.1.)

During service processor reset, such as during flashing or mouse mode changes, an SP kernel error might occur that leaves the service processor reachable, but which hangs serial login, SSH, the web GUI, and CLI services.

Workaround

Reset the system using the remaining working service or by removing the system power.

JavaRConsole Takes Too Long to Notify User of Disconnection (6487885)

(Fixed in Software 2.1.)

The JavaRConsole fails to notify the user that it has been disconnected in a timely manner.

Workaround

After changing the ILOM network configuration or resetting the SP, exit from any browser or client applications, then manually restart the SP connections.

Break Key Does Not Work in Secure Shell (SSH) Session or from JavaRConsole (6306610)

Breaks are transmitted to the system serial port only from the serial management port. The Break key does not work when you enter ssh ~B in a secure shell (SSH) session, or in the JavaRConsole.

Workaround

None.

External Storage Redirection Error Messages Might Be Wrong

You can redirect remote storage devices to these servers by starting JavaRConsole from the ILOM web GUI (Remote Console -> Redirection). Because the ILOM has this capability, the operating system might display redirected USB storage devices as always being attached.

If redirection is disabled, however, you might see one of the following messages if you attempt to access those devices: Drive not ready or No media found. Starting and stopping storage redirection does not add or remove the virtual devices themselves, but affects only the media in those virtual devices.


SNMP Issues

Current Issues

SP Does Not Support SNMP Traps (6396525)

The service processor does not support SNMP traps for the SUN-PLATFORM-MIB.

Workaround

Use IPMI PET traps, a form of SNMP trap, to trap errors. You can implement them by configuring alert rules in the service processor, or by configuring PEF rules directly in IPMI. See the Integrated Lights Out Manager Administration Guide, 819-1160, for details.

Resolved Issues

Unimplemented SNMP Traps (6300437)

(Fixed in Software 1.1.)

The following traps are currently not supported:

Error Messages Are Unclear When Deleting an SNMP User (6284706)

(Fixed in Software 1.0.)

For example, you might see the message Target cannot be deleted when a user is being deleted soon after another activity.

Workaround

These error messages can be safely ignored. However, scripted commands might not succeed.

sysUpTime Values Might be Incorrect (6295609)

(Fixed in Software 1.1.)

For example, a system that has been assembled only a few days might show an uptime of 51 days.

Workaround

You can safely ignore these values.

SNMP Agent Does Not Support SETs (6255301)

The SNMP agent does not support SETs for the entity and Sun platform MIBs. This causes compliance tests involving SETs to fail.

Workaround

Use other services to perform the needed tasks.


BIOS Issues

Current Issues

NICs Are Removed from Boot-Priority List when Using Ctrl or Alt Commands to Enable the Option BIOS for Sun Cards (6462303)

This applies to the following PCI cards:

When you use the Ctrl-Q or Alt-E commands to enable the option BIOS for these PCI cards, the system removes the onboard Intel NICs from the boot-up priority list of the BIOS setup menu. In addition, if you use the Intel X4446A-Z NIC cards to access the installer, you see the following error message:

A disk read error occurred
Press ctrl + Alt + del to restart
Workaround

None.

HDD Order Changes in BIOS Settings After Installing or Removing HBA Card (6462303, 6450677, 6630185)

After removing or installing a supported host bus adapter (HBA) card, the HDD order in the BIOS might be changed. Therefore, you might not be able to boot the system off of an OS level from an internal HDD if the HDDs that are attached to an HBA are scanned first. The BIOS scans devices in ascending order (from low-PCI address to high-PCI address). This is a list of the devices:

1. PCIE Slot 0

2. PCIE Slot 1

3. PCIE Slot 3

4. PCIE Slot 4

5. PCIX Slot 2

6. On-board NIC

7. On-board SAS controller

8. USB

Refer to FIGURE 2-1 for the locations of the PCI slots.

FIGURE 2-1 Designation and Speeds of PCI Slots


Diagram showing the locations, designations, and speeds of the PCI slots on the motherboard.

Workaround

This is expected behavior. After installing or removing any supported HBA card that is connected to an external storage device, there are two possible workarounds to ensure that you can boot from your devices as desired.

Option 1: Disable option ROM scanning on all devices that do not need to PXE boot. This will allow the device you want to boot from to appear in the list. Use the following procedure.

1. Enter the BIOS Setup utility by pressing the F2 key while the system is booting up and performing POST.

2. On the BIOS Main Menu screen, select the PCIPnP tab to open the PCI/PnP Settings screen.

3. Change the fields to Disabled for those PCI cards or NICs that will not be PXE booted.

4. Press and release the right arrow key until the Exit menu screen is displayed.

5. Follow the instructions on the Exit menu screen to save your changes and exit the Setup utility.

 

Option 2: Manually set the BIOS boot order so that the devices that you want to PXE boot from are early enough in the boot order to be scanned before the option ROM space is exhausted. Use the following procedure:

1. Enter the BIOS Setup utility by pressing the F2 key while the system is booting and performing POST.

2. On the BIOS Main Menu screen, select the Boot tab to open the Boot menu main screen.

3. Select Hard Disk Drives from the list.

4. Change the selections for the boot devices to set the order of boot devices that you require.

5. Press and release the right arrow key until the Exit menu screen is displayed.

6. Follow the instructions on the Exit menu screen to save your changes and exit the Setup utility.

Option ROM Space for PXE Booting Exhausted Before All Devices are Scanned (6453144, 6403173, 6272514, 6393809, 6439856, 6462303)

The BIOS Option ROM is 128 KB. Of these 128 KB, approximately 80 KB are used by the VGA controller, the LSI controller, and the NIC. Approximately 48 KB remain for the option ROM.

Depending on the number of PCI devices that are installed and being used for PXE booting, the option ROM space can be exhausted before the on-board NICs, the SAS controller, or the PCI cards installed late in the boot order can be scanned by the BIOS. This is expected behavior.

Possible Effects:

The BIOS scans devices in ascending order (from low PCI address to high PCI address). For scanning priority, see HDD Order Changes in BIOS Settings After Installing or Removing HBA Card (6462303, 6450677, 6630185).

Refer to FIGURE 2-1 for the locations of the PCI slots.

Workarounds

There are two possible workarounds to ensure that you have enough option ROM space to PXE boot from your devices as desired.

Option 1: Disable option ROM scanning on all devices that do not need to PXE boot. This will preserve the option ROM space for the devices that you do want to PXE boot. Use the following procedure.

1. Enter the BIOS Setup utility by pressing the F2 key while the system is booting up and performing POST.

2. On the BIOS Main Menu screen, select the PCIPnP tab to open the PCI/PnP Settings screen.

3. Change the fields to Disabled for those PCI cards or NICs that will not be PXE booted.

4. Press and release the right arrow key until the Exit menu screen is displayed.

5. Follow the instructions on the Exit menu screen to save your changes and exit the Setup utility.

 

Option 2: Manually set the BIOS boot order so that the devices that you want to PXE boot from are early enough in the boot order to be scanned before the option ROM space is exhausted. Use the following procedure:

1. Enter the BIOS Setup utility by pressing the F2 key while the system is booting and performing POST.

2. On the BIOS Main Menu screen, select the Boot tab to open the Boot menu main screen.

3. Select Boot Device Priority, or select Hard Disk Drives from the list to change HDDs.

4. Change the selections for the boot devices or HDDs drives to set the required device order.

5. Press and release the right arrow key until the Exit menu screen is displayed.

6. Follow the instructions on the Exit menu screen to save your changes and exit the Setup utility.

Sixteen-Boot-Device Limitation Exceeded Before All Devices are Scanned (6268877, 6439856)

The server BIOS can scan up to 16 devices, but depending on what is installed in the server, this limit can be exceeded before all devices in the server can be scanned. For example, if you install an HBA with an attached disk array into a PCI slot that is scanned before the on-board LSI SAS controller, the internal HDDs might not be scanned if the HBA and its attached HDDs exceed the 16-device limit.

This is expected behavior. Devices in excess of the 16-device limit will be seen at the OS level, but are not scanned by the BIOS during POST.

Possible Effects:

The BIOS scans devices in ascending order (from low PCI address to high PCI address). For scanning priority, see HDD Order Changes in BIOS Settings After Installing or Removing HBA Card (6462303, 6450677, 6630185).

Refer to FIGURE 2-1 for the locations of the PCI slots.

Workarounds

Perform Option 1 so that the device appears in the list, then perform Option 2 to change the scanning order.

Option 1: Disable option ROM scanning on all devices that do not need to PXE boot. This will preserve the option ROM space for the devices that you do want to PXE boot. Use the following procedure.

1. Enter the BIOS Setup utility by pressing the F2 key while the system is booting up and performing POST.

2. On the BIOS Main Menu screen, select the PCIPnP tab to open the PCI/PnP Settings screen.

3. Change the fields to Disabled for those PCI cards or NICs that will not be PXE booted.

4. Press and release the right arrow key until the Exit menu screen is displayed.

5. Follow the instructions on the Exit menu screen to save your changes and exit the Setup utility.

Option 2: Manually set the BIOS boot order so that the devices that you want to PXE boot from are early enough in the boot order to be scanned before the option ROM space is exhausted. Use the following procedure:

1. Enter the BIOS Setup utility by pressing the F2 key while the system is booting and performing POST.

2. On the BIOS Main Menu screen, select the Boot tab to open the Boot menu main screen.

3. Select Boot Device Priority, or select Hard Disk Drives from the list to change HDDs.

4. Change the selections for the boot devices or HDDs to set the required device order.

5. Press and release the right arrow key until the Exit menu screen is displayed.

6. Follow the instructions on the Exit menu screen to save your changes and exit the Setup utility.

When Downgrading BIOS From Software 2.2 to 2.1, BIOS 81, Jumper is Needed to Clear RTC Settings If System Freezes (6768214)

If the system freezes when downgrading from BIOS software 2.2 to BIOS software 2.1 (0ABJX081), clearing the CMOS connector (J9) does not power on the system.

Workaround: You must use the jumper to reset the RTC settings in the case of a system freeze.

1. Shut down the server.

2. Disconnect the power cords from the server.

3. Remove the main cover of the server.

4. Locate the RTC reset connector(J31) on the motherboard.

5. Install the jumper to the RTC reset connector(J31) pins.

6. Wait 10 seconds, then remove the jumper.

7. Reinstall the main cover to the server.

8. Reconnect power cords to the server.

A Disk Swap in RAID1/RAID1E Causes an Error Message (6766054, 6765998)

RAID1/RAID1E was created on a system. Then RHEL 5.2/SLES 10 SP2 and the latest MPT drivers were installed. After a reboot and pulling out and reinserting an HDD, error information results in system messages. The system message and OS log shows the error:

mptsas: ioc0: ERROR - mptsas_test_unit_ready: fw_channel=0 fw_id=1: tur failed due to no device.

This does not affect RAID function.

Resolved Issues

POST Message Can be Safely Ignored: Bank Interleave Requested But Not Enabled (6464704)

This message appears during POST on a system that contains only one pair of single-rank DIMMs on one CPU. This type of configuration does not support bank interleaving.

Workaround

This message is for information purposes only and can be safely ignored.

BIOS Programming Of ProcODT/ODC Field Is Incorrect (6684970)

Processor ODT (termination) is incorrectly being programmed to 3 which is Reserved. This field [29:28] is found in the Output Driver Compensation Control Register and must be programmed as follows:

One DIMM per channel -- 75 ohms = 10b

Two or more DIMMs per channel -- 150 ohms = 01b

During Upgrade To Quad Core, System Does Not Display Console and Cannot Start SYS (6733354)

When upgrading the system from dual core to quad core, system does not display console and cannot start SYS during upgrade.


Accessibility Issues

Accessibility means removing barriers that can prevent people with disabilities from participating in substantial life activities, including the use of services, products, and information. Not only does providing access offer benefits for a wide range of users, but it is also a requirement in all current federal contracts under Section 508 of the Federal Rehabilitation Act. In the commercial sector, the Americans with Disabilities Act (ADA) calls for similar considerations when reasonably accommodating current and prospective employees.

The Sun Fire X4100/X4200 servers meet Section 508 accessibility requirements. However, the following accessibility issues have been noted in the SP-firmware GUI: