C H A P T E R  4

Dynamic Reconfiguration on Sun Fire Midrange Systems

This chapter describes major issues related to dynamic reconfiguration (DR) on Sun Fire midrange (E6900/E4900/6800/4810/4800/3800) systems running Solaris 9 9/05 software. It includes the following topics:


General Information

This section contains general information about DR on Sun Fire midrange systems.

Minimum SC Firmware

TABLE 4-1 shows acceptable combinations of Solaris 9 software and SC firmware for each Sun Fire midrange system to run DR. If the platform listed in the first column is running the Solaris release shown in the second column, the minimum SC firmware release is on that same line in the third column.



Note - To take advantage of the latest firmware features and bug fixes, run the most recent SC firmware on your Sun Fire midrange system. For the latest patch information, see http://sunsolve.sun.com




TABLE 4-1 Minimum SC Firmware for Each Platform/Solaris Release

Platform

Solaris Release

Minimum SC Firmware

E6900/E4900

Support begins with Solaris 9 4/04

5.16.0

6800/4810/4800/3800

Solaris 9 4/04

5.14.4

6800/4810/4800/3800

Solaris 9

5.12.6


System Board Classes

The cfgadm command shows Sun Fire midrange server system boards as class "sbd" and CompactPCI (cPCI) cards as class "pci."

To view the classes that are associated with attachment points, run the following command as superuser:

# cfgadm -s "cols=ap_id:class"

To also list the dynamic attachment points and their classes, add the cfgadm command's -a option as an argument to the preceding command.

Sun Management Center

For information about using Sun Management Center (Sun MC) with your Sun Fire midrange system, refer to the Sun Management Center Supplement for Sun Fire Midrange Systems.

Upgrading System Firmware

You can upgrade the system firmware for your Sun Fire midrange system through connection with an FTP or HTTP server where the firmware images are stored. For more information, refer to the README and Install.info files included in the firmware release(s) running on your domains. You can download Sun patches from http://sunsolve.sun.com.


Known DR Software Bugs

This section lists important DR bugs.

cfgadm_sbd Plug-In Signal Handling Is Broken (BugID 4498600)

Description: Sending a catchable signal, such as SIGINT sent by CTRL-C, to one or more cfgadm instances can cause those instances to hang. The problem is more likely to occur when multiple cfgadm processes are running, and can affect cfgadm instances on system boards, processors, I/O boards, and PCI slot attachment points. The problem has not been observed with a SIGKILL, and does not affect cfgadm status commands.

Workaround: None. To avoid this bug, do not send a catchable signal to a cfgadm process invoked to change the state of a component; for example, one executed with its -c or -x option.

SBM Sometimes Causes System Panic During DR Operations (Bug ID 4506562)

Description: A panic might occur when a system board that contains CPUs is removed from the system while Solaris Bandwidth Manager (SBM) is in use.

Workaround: Do not install SBM on systems that will be used for DR, and do not perform CPU system board DR operations on systems with SBM installed.

DR Hangs During Configure Operation with IB Board with vxdmpadm policy=check_all (Bug ID 4509462)

Description: A DR configure operation hangs with an IBx (I/O) board after a few successful iterations. This occurs when the DR operation is executed concurrently with the DMP daemon that is implementing the policy check_all with a time interval.

Workaround: Install VM 3.2 Patch 01.

Cannot DR Out cPCI IB with P0 Disabled (BugID 4798990)

Description: On Sun Fire midrange systems, a compact PCI (cPCI) I/O board cannot be unconfigured when Port 0 (P0) on that board is disabled. This problem exists only on systems running Solaris 9 or Solaris 8 software with PatchID 108528-23. It occurs only during DR operations that involve cPCI boards, and displays an error message similar to the following:

# cfgadm -c unconfigure NO.IB7
cfgadm: Hardware specific failure: unconfigure N0.IB7: Device
 busy:/ssm@0,0/pci@1b,700000/pci@1

where NO.IB7 is a compact PCI I/O board with P0 disabled.

Workaround: If you do not need to disable P0 itself, disable its slots, instead.

panic: mp_cpu_quiesce: cpu_thread != cpu_idle_thread (BugID 4873353)

Description: If a processor is transitioned from the powered-off to the off-line state with psradm(1M), a subsequent DR unconfigure operation on this processor can result in a system panic.

Workaround: Do not use psradm(1M) to offline a processor that is in the powered-off state.