Information About Software

This section includes the following sections:

This section describes specific software and firmware issues and workarounds. To obtain patches and to check for availability of new patches that fix these issues, go to:

http://sunsolve.sun.com


XCP Issues and Workarounds

TABLE 1 lists XCP issues and possible workarounds.


TABLE 1 XCP Issues and Workarounds

ID

Description

Workaround

6723305

The XSCF Web does not show the correct daylight saving time.

There is no workaround. Use the showlogs(8) command to determine time during daylight savings time.

6741770

SNMP trap host configuration changes are not valid until setsnmp disable and setsnmp enable.

When you have modified the SNMP setting:

XSCF> setsnmp disable
XSCF> setsnmp enable

6756052

The panic log on the XSCF Web might not be displayed from the top of the message.

When the output is insufficient, execute the showlogs panic command on the XSCF shell.

6757614

The network configuration on the XSCF Web does not support the function equivalent to the setnetwork -r option. And when you specified localhost or localdomain to the host name or the domain name, the error message “SessionID has expired” appears

Use the setnetwork -r command on the XSCF shell.

6761674

The first usage of the OBP command probe-scsi-all might not show all devices connected via the external SAS port.

Retry the probe-scsi-all command.

6765466

The information related to the defined LSB cannot be displayed on the XSCF Web “Domain Configuration” page with an account which has a single domain privilege larger than DID15. In addition, the operation for such LSB by “XSB Config...” button cannot be performed.

Use the showdcl(8), addboard(8), deleteboard(8), and moveboard(8) commands on the XSCF shell.

6765468

When the timezone other than three characters has been set, the error logs cannot be displayed on XSCF Web “Error Log” page. In addition, XSCF Web “Panic Log” and “IPL Message Log” pages displays the date on the table with “---”.

Use the showlogs(8) command on the XSCF

shell.

6767612

The monitor message log might not be

registered when a PCI slot error detected.

There is no workaround.

Use the showlogs error command or the

fmdump command to check the fault

information of PCI slot.

6789066

the showlogs command results in a segmentation error when the omission name of the set time zone has more than 8 characters.

No workaround is available.

 

6778132

When using the replacefru(8) command to replace the XSCF unit, the replacement might fail with a DB synchronization timeout error in the log.

Turn off all mainline switches and then turn on again.

Or, just after the XSCF reset, using the rebootxscf(8) command, execute the replacefru(8) command and extract and insert the XSCF unit again. If time passed after the XSCF reset, the replacefru(8) command may fail again. In such a case, re-execute the rebootxscf(8) command and the subsequent processes.



Solaris OS Issues and Workarounds

This section contains information about Solaris OS issues. TABLE 2, TABLE 4, and TABLE 3 list issues you might encounter, depending upon which Solaris OS release you are using.

Solaris Issues for All Supported Releases

TABLE 2 lists Solaris OS issues that you might encounter in any supported release of Solaris OS.


TABLE 2 Solaris OS Issues and Workarounds for All Supported Releases

CR ID

Description

Workaround

6440061

The domain console may display this message:

ipsec_check_inbound_policy: Policy

Failure for the incoming packet

(not secure)

This message can be safely ignored.

6449315

The Solaris OS cfgadm(1M) command does not unconfigure a DVD drive from a domain on a M8000/M9000 server.

Disable the Volume Management Daemon (vold) before unconfiguring a DVD drive with the cfgadm(1M) command. To disable vold, stop the daemon by issuing the command /etc/init.d/volmgt stop. After the device has been removed or inserted, restart the daemon by issuing the command /etc/init.d/volmgt start.

6459540

The DAT72 internal tape drive might time out during tape operations.

The device might also be identified by the system as a QIC drive.

Add the following definition to /kernel/drv/st.conf:

 

tape-config-list=
"SEAGATE DAT    DAT72-000",
"SEAGATE_DAT____DAT72-000",
"SEAGATE_DAT____DAT72-000";
SEAGATE_DAT____DAT72-000=1,0x34,0,0x9639,4,0x00,0x8c,0x8c,
0x8c,3;

 

There are four spaces between SEAGATE DAT and DAT72-000.

6522017

Domains using the ZFS file system cannot use DR.

Set the maximum size of the ZFS ARC lower. For detailed assistance, contact your authorized service representative.

6531036

The error message network initialization failed appears repeatedly after a boot net installation.

There is no workaround.

6531668

System hangs when executing parallel hot-plug operation with SP DR in suspend phase.

There is no workaround.

6532215

volfs or dscp services might fail when a domain is booted.

Restart the service. To avoid the problem, issue the following commands.

# svccfg -s dscp setprop
start/timeout_seconds=count: 300
# svccfg -s volfs setprop
start/timeout_seconds=count: 300
# svcadm refresh dscp
# svcadm refresh volfs

6535018

In Solaris domains that include SPARC64 VII processors, workloads that make heavy use of the Solaris kernel might not scale as expected when you increase the thread count to a value greater than 256.

This has been fixed in patch 137111-01.

 

For Solaris domains that include SPARC64 VII processors, limit domains to a maximum of 256 threads.

6565553,

6674266

DR deleteboard(8) and moveboard(8) operations might fail.

Example for messages on domain:

drmach: WARNING: Device driver
failure: /pci
dcs: <xxxx> config_change_state:
Hardware specific failure:
unconfigure SB1: Device driver
failure: /pci

There is no workaround.

Try DR operations again.

6572827

The prtdiag -v command reports PCI bus types incorrectly. It reports “PCI” for PCI-X leaf devices and “UNKN” for legacy PCI devices.

There is no workaround.

6575970

DR and XSCF failover are not compatible.

 

6588555

Resetting the XSCF during a DR operation on permanent memory might cause domain panic.

Do not start an XSCF reset while a DR operation is underway. Wait for the DR operation to complete before starting the reset.

6588650

On occasion, the system is unable to DR after an XSCF failover to or from backup XSCF.

There is no workaround.

6589644

When XSCF switchover happens after the SB has been added using the addboard command, the console is no longer available.

The console can be recovered by pressing
Ctrl-q (the “Ctrl” key and the “q” key).

6592302

Unsuccessful DR operation leaves memory partially configured.

It might be possible to recover by adding the board back to the domain with an addboard -d command.

6619224

For Solaris domains that include SPARC 64 VII processors, a single domain of 256 threads or more might hang for an extended period of time under certain unusual situations. Upon recovery, the uptime command will show extremely high load averages.

This has been fixed in patch 137111-03.

 

For Solaris domains that include SPARC 64 VII processors, do not exceed a domain size of 256 virtual processors in a single Solaris domain. This means a maximum of 32 CPUs in a single domain configuration (maximum configuration for an M8000 server).

6623226

The Solaris command lockstat(1M) or the dtrace lockstat provider might cause a system panic.

This has been fixed in patch 140336-01.

[Workaround]

Do not use the Solaris lockstat(1M) command or the dtrace lockstat provider.

6625734

Systems with large number of processors in a single domain environment may have suboptimal performance with certain workloads.

Use processor sets to bind application processes or LWPs to groups of processors. Refer to the psrset(1M) man page for more information.

6660168

If a ubc.piowbeue-cpu error occurs on a domain, the Solaris Fault Management cpumem-diagnosis module might fail, causing an interruption in FMA service.

If this happens, you will see output similar to the following sample in the console log:

If fmd service fails, issue the following command on the domain to recover:

# svcadm clear fmd 

Then restart cpumem-diagnosis:

# fmadm restart cpumem-diagnosis

 

SUNW-MSG-ID: FMD-8000-2K, TYPE:
Defect, VER: 1, SEVERITY: Minor
EVENT-TIME: Fri Apr  4 21:41:57 PDT
2008
PLATFORM: SUNW,SPARC-Enterprise, CSN:
2020642002, HOSTNAME: <hostname>
SOURCE: fmd-self-diagnosis, REV: 1.0
EVENT-ID: 6b2e15d7-aa65-6bcc-bcb1-
cb03a7dd77e3
DESC: A Solaris Fault Manager
component has experienced an error
that required the module to be
disabled.  Refer to
http://sun.com/msg/FMD-8000-2K for
more information.
AUTO-RESPONSE: The module has been
disabled.  Events destined for the
module will be saved for manual
diagnosis.
IMPACT: Automated diagnosis and
response for subsequent events
associated with this module will not
occur.
REC-ACTION: Use fmdump -v -u 
<EVENT -ID> to locate the module. Use
fmadm reset <module> to reset the
module.

 

6668237

After DIMMs are replaced, the corresponding DIMM faults are not cleared on the domain.

Use the command fmadm repair fmri|uuid to record the repair. Then you can use the command fmadm rotate to clear out any leftover events.

6679370

The following message may be output on the console during the system booting, the External I/O Expansion Unit adding by hotplug, or the FMEMA operating by DR.

SUNW-MSG-ID: SUN4-8000-75, TYPE: 
Fault, VER: 1, SEVERITY: Critical
...
DESC:
A problem was detected in the
PCIExpress subsystem.
Refer to http://sun.com/msg/SUN4-
8000-75 for more information.
...

Add the following to /etc/system, then reboot the domain.

set pcie_expected_ce_mask = 0x2001

6680733

Sun Quad-port Gigabit Ethernet Adapter UTP (QGC) & Sun Dual 10 GigE Fiber XFP Low Profile Adapter (XGF) NICs might panic under high load conditions.

This has been fixed in patch 139570-01.

6689757

Sun Dual 10 GigE Fiber XFP Low Profile Adapter (XGF) with a single or improperly installed XFP optical transceivers might cause the following error to show on the console:

The XFP optical transceiver is broken or missing.

This has been fixed in patch 139570-01.

[Workaround]

Check and make sure that both XFP optical transceivers are firmly seated in the housing.

Do not mix INTEL and Sun XFP optical transceivers in the same Adapter.

Do NOT plumb a port with the ifconfig command if the port does not contain an XFP optical transceiver or it contains one but the transceiver is not in use.

6707628

Scheduler decisions on Mx000 systems are occasionally unbalanced.

Sometimes two threads will be on one core (causing both to run at about half speed) while another core is idle. For many OpenMP and similar parallel applications, the application performance is limited by the speed of the slowest thread.

Uneven scheduling is not common, perhaps 1 in 50 or 1 in 100 decisions. But if there are 128 threads running, then the application might have at least one uneven schedule event.

Use processor sets to prevent uneven threads to core assignment.

6723202

The raidctl command cannot be used to create a hardware RAID using the onboard SAS/LSI controller on the M3000 server.

The raidctl command can be used to view disk/controller status, and can be used on any PCI Host Bus Adapter (HBA) installed in the system.

No workaround is available. This issue will not be fixed.

6745410

Boot program ignores the Kadb option which causes the system not to boot.

Use kmdb instead of kadb.

6794630

An attempt to use the GUI to install Solaris in a domain larger than 2TB might fail.

Use the Command Line Interface to install Solaris.


Solaris Issues Fixed in Solaris 10 10/08

TABLE 3 lists issues that have been fixed in Solaris 10 10/08 OS. You might encounter them in supported releases earlier than Solaris 10 10/08.


TABLE 3 Solaris OS Issues and Workarounds Fixed in Solaris 10 10/08

CR ID

Description

Workaround

6511374

Memory translation warning messages might appear during boot if memory banks were disabled due to excessive errors.

This has been fixed in patch 137137-09.

[Workaround]

After the system is rebooted, the fmadm repair command can be used to prevent a recurrence of the problem on the next boot.

6533686

When XSCF is low on system resources, DR deleteboard or moveboard operations that relocate permanent memory might fail with one or more of these errors:

SCF busy 
DR parallel copy timeout

This applies only to Quad-XSB configured System Boards hosting multiple domains.

This has been fixed in patch 138397-01.

 

Retry the DR operation at a later time.

6535018

In Solaris domains that include SPARC64 VII processors, workloads that make heavy use of the Solaris kernel might not scale as expected when you increase the thread count to a value greater than 256.

This has been fixed in patch 137111-01.

[Workaround: For Solaris domains that include SPARC64 VII processors, limit domains to a maximum of 256 threads.

6556742

The system panics when DiskSuite cannot read the metadb during DR. This bug affects the following cards:

  • SG-XPCIE2FC-QF4, 4-Gigabit PCI-e Dual-Port Fiber Channel HBA
  • SG-XPCIE1FC-QF4, 4-Gigabit PCI-e Single-Port Fiber Channel HBA
  • SG-XPCI2FC-QF4, 4-Gigabit PCI-X Dual-Port Fiber Channel HBA
  • SG-XPCI1FC-QF4, 4-Gigabit PCI-X Single-Port Fiber Channel HBA

Panic can be avoided when a duplicated copy of the metadb is accessible via another Host Bus Adaptor.

6589833

The DR addboard command might cause a system hang if you are adding a Sun StorageTek Enterprise Class 4-Gigabit Dual-Port Fiber Channel PCI-E HBA card (SG-XPCIE2FC-QF4) at the same time that an SAP process is attempting to access storage devices attached to this card. The chance of a system hang is increased if the following cards are used for heavy network traffic:

  • X4447A-Z, PCI-e Quad-port Gigabit Ethernet Adapter UTP
  • X1027A-Z1, PCI-e Dual 10 Gigabit Ethernet Fiber XFP Low profile Adapter

There is no workaround.

6614737

The DR deleteboard(8) and moveboard(8) operations might hang if any of the following conditions exist:

A DIMM has been degraded.

The domain contains system boards with different memory size.

For Solaris 10 5/08 or earlier, this has been fixed in patch 137111-01.

 

Avoid performing DR operations if any of the following conditions exist:

  • Degraded memory - To determine whether the system contains degraded memory, use the XSCF command showstatus. For sample output see Identifying System Memory.
  • Differing memory sizes - To determine whether the domain contains system boards with different memory sizes, display the list of memory sizes using the XSCF command showdevices or the prtdiag command on the domain. For sample output, see Identifying System Memory.

If a DR command hangs, reboot the domain to recover.

6632549

fmd service on domain might fail to go into maintenance mode after DR operations.

This has been fixed in patch 138050-01.

 

Issue the following command on the domain:

# svcadm clear fmd

6660197

DR might cause the domain to hang if either of the following conditions exist:

  • A domain contains 256 or more CPUs.
  • Memory error occurred and the DIMM has been degraded.

This has been fixed in patch 138397-01.

 

  1. Set the following parameter in the system specification file (/etc/system):
set drmach:drmach_disable_mcopy = 1
  1. Reboot the domain.

6720261

If your domain is running Solaris 10 5/08 OS,

the system might panic/trap during normal

operation:

This has been fixed in patch 137137-09.

[Workaround: Set the following parameter in the system specification file (/etc/system):

set heaplp_use_stlb=0

Then reboot the domain.


 

Solaris Issues Fixed in Solaris 10 5/08

TABLE 4 lists issues that have been fixed in Solaris 10 5/08 OS. You might encounter them in supported releases earlier than Solaris 10 5/08.


TABLE 4 Solaris OS Issues and Workarounds Fixed in Solaris 10 5/08

CR ID

Description

Workaround

5076574

A PCIe error can lead to an invalid fault diagnosis on a large M9000/M8000 domain.

Create a file /etc/fm/fmd/fmd.conf containing the following lines;

setprop client.buflim 40m
setprop client.memlim 40m

6348554

Using the cfgadm -c disconnect command on the following cards might hang the command:

  • SG-XPCIE2FC-QF4, Sun StorageTek Enterprise Class 4-Gigabit Dual-Port Fiber Channel PCI-E HBA
  • SG-XPCIE1FC-QF4, Sun StorageTek Enterprise Class 4-Gigabit Single-Port Fiber Channel PCI-E HBA
  • SG-XPCI2FC-QF4, Sun StorageTek Enterprise Class 4-Gigabit Dual-Port Fiber Channel PCI-X HBA
  • SG-XPCI1FC-QF4, Sun StorageTek Enterprise Class 4-Gigabit Single-Port Fiber Channel PCI-X HBA

Do not perform cfgadm -c disconnect operation on the affected cards.

6472153

If you create a Solaris Flash archive on a non-Sun SPARC Enterprise M4000/M5000 sun4u server and install it on a Sun SPARC Enterprise M4000/M5000 sun4u server, the console’s TTY flags will not be set correctly. This can cause the console to lose characters during stress.

For Solaris 10 8/07 or earlier, this has been fixed in patch 137046-01.

 

Just after installing Solaris OS from a Solaris Flash archive, telnet into the Sun SPARC Enterprise M4000/M5000 server to reset the console’s TTY flags as follows:

# sttydefs -r console
# sttydefs -a console -i "9600 hupcl opost
onlcr crtscts" -f "9600"

 

This procedure is required only once.

6505921

Correctable error on the system PCIe bus controller generates an invalid fault.

For Solaris 10 8/07 or earlier, this has been fixed in 120011-14 and 125369-05.

 

Create a file /etc/fm/fmd/fmd.conf containing the following lines;

setprop client.buflim 40m
setprop client.memlim 40m

6522433

The incorrect motherboard might be identified by fmdump for cpu faults after reboot.

For Solaris 10 8/07 or earlier, this has been fixed in patch 127127-11.

Check system status on XSCF.

6527811

The showhardconf(8) command on the XSCF cannot display PCI card information that is installed in the External I/O Expansion Unit, if the External I/O Expansion Unit is configured using PCI hot-plug.

For Solaris 10 8/07 or earlier, this has been fixed in patch 128346-01.

 

There is no workaround. When each PCI card in the External I/O Expansion Unit is configured using PCI hot-plug, the PCI card information is displayed correctly.

6536564

showlogs(8) and showstatus(8) command might report wrong I/O component.

For Solaris 10 8/07 or earlier, this has been fixed in patch 125369-05.

 

To avoid this problem, issue the following commands on the domain.

 

# cd
/usr/platform/SUNW,SPARCEnterprise/
lib/fm/topo/plugins
# mv ioboard.so ioboard.so.orig
# svcadm restart fmd
 

Contact a service engineer if the following messages are displayed:

 

SUNW-MSG-ID: SUNOS-8000-1L, TYPE:
Defect, VER: 1, SEVERITY: Minor
EVENT-TIME: Sun May 6 18:22:24 PDT
2007
PLATFORM: SUNW,SPARC-Enterprise,
CSN: BE80601007, HOSTNAME: sparc

6545143

There is a low probability that a system panic can occur during trap processing of a TLB miss for a user stack address. The problem can occur if the user stack is unmapped concurrently with the user process executing a flush windows trap (ta 3). The panic message will contain the following string:

bad kernel MMU trap at TL 2

For Solaris 10 8/07 or earlier, this has been fixed in patch 127111-08.

 

There is no workaround.

6545685

If the system has detected Correctable MemoryErrors (CE) at power-on self-test (POST), the domains might incorrectly degrade 4 or 8 DIMMs.

For Solaris 10 8/07 or earlier, this has been fixed in patch 127111-08.

 

Increase the memory patrol timeout values used via the following setting in /etc/system and reboot the system:

set mc-opl:mc_max_rewrite_loop = 20000

6546188

The system panics when running hot-plug (cfgadm) and DR operations (addboard and deleteboard) on the following cards:

  • X4447A-Z, PCI-e Quad-port Gigabit Ethernet Adapter UTP
  • X1027A-Z1, PCI-e Dual 10 Gigabit Ethernet Fiber XFP Low profile Adapter

For Solaris 10 8/07, this has been fixed in patch 127741-01.

 

There is no workaround.

6551356

The system panics when running hot-plug (cfgadm) to configure a previously unconfigured card. The message “WARNING: PCI Expansion ROM is not accessible” will be seen on the console shortly before the system panic. The following cards are affected by this defect:

  • X4447A-Z, PCI-e Quad-port Gigabit Ethernet Adapter UTP
  • X1027A-Z1, PCI-e Dual 10 Gigabit Ethernet Fiber XFP Low profile Adapter

For Solaris 10 8/07, this has been fixed in patch 127741-01.

Note - Do not use cfgadm -c unconfigure to disconnect the I/O card.

Use cfgadm -c disconnect to completely remove the card. After waiting at least 10 seconds, the card might be configured back into the domain using the cfgadm -c configure command.

6559504

Messages of the form nxge: NOTICE: nxge_ipp_eccue_valid_check: rd_ptr = nnn wr_ptr = nnn will be observed on the console with the following cards:

  • X4447A-Z, PCI-e Quad-port Gigabit Ethernet Adapter UTP
  • X1027A-Z1, PCI-e Dual 10 Gigabit Ethernet Fiber XFP Low profile Adapter

These messages can be safely ignored.

6563785

Hot-plug operation with the following cards might fail if a card is disconnected and then immediately reconnected:

  • SG-XPCIE2SCSIU320Z Sun StorageTek PCI-E Dual-Port Ultra320 SCSI HBA
  • SGXPCI2SCSILM320-Z Sun StorageTek PCI Dual-Port Ultra320 SCSI HBA

After disconnecting a card, wait for a few seconds before re-connecting.

6564934

Performing a DR deleteboard operation on a board which includes Permanent Memory when using the following network cards results in broken connections:

  • X4447A-Z, PCI-e Quad-port Gigabit Ethernet Adapter UTP
  • X1027A-Z1, PCI-e Dual 10 Gigabit Ethernet Fiber XFP Low profile Adapter

For Solaris 10 8/07, this has been fixed in patch 127741-01.

 

Reconfigure the affected network interfaces after the completion of the DR operation. For basic network configuration procedures, refer to the ifconfig man page for more information.

6568417

After a successful CPU DR deleteboard operation, the system panics when the following network interfaces are in use:

  • X4447A-Z, PCI-e Quad-port Gigabit Ethernet Adapter UTP
  • X1027A-Z1, PCI-e Dual 10 Gigabit Ethernet Fiber XFP Low profile Adapter

For Solaris 10 8/07 or earlier, this has been fixed in patch 127111-02.

 

Add the following line to /etc/system and reboot the system:

set ip:ip_soft_rings_cnt=0 

6571370

Use of the following cards have been observed to cause data corruption in stress test under laboratory conditions:

  • X4447A-Z, PCI-e Quad-port Gigabit Ethernet Adapter UTP
  • X1027A-Z1, PCI-e Dual 10 Gigabit Ethernet Fiber XFP Low profile Adapter

For Solaris 10 8/07, this has been fixed in patch 127741-01.

 

Add the following line in /etc/system and reboot the system:

set nxge:nxge_rx_threshold_hi=0 

6584984

The busstat(1M) command with -w option might cause domains to reboot.

There is no workaround. Do not use busstat(1M) command with -w option on pcmu_p.

6589546

prtdiag does not show all IO devices of the following cards:

  • SG-XPCIE2FC-EM4 Sun StorageTek Enterprise Class 4-Gigabit Dual-Port Fiber Channel PCI-E HBA
  • SG-XPCIE1FC-EM4 Sun StorageTek Enterprise Class 4-Gigabit Single-Port Fiber Channel PCI-E HBA

For Solaris 10 8/07 or earlier, this has been

fixed in patch 127127-11.

 

Use prtdiag -v for full output.


Solaris Issues Fixed in Solaris 10 8/07

TABLE 5 lists issues that have been fixed in Solaris 10 8/07 OS. You might encounter them in Solaris 10 11/06.



caution icon Caution - ICR ID #6534471: Improper handling of large page in kernel memory may cause random panics. Implement the workaround for CR ID #6534471 or check for the availability of a patch and install it immediately. This bug has been fixed by 125100-06 and Solaris 10 8/07.


 
TABLE 5 Solaris OS Issues and Workarounds Fixed in Solaris 10 8/07

CR ID

Description

Workaround

6303418

M9000 server with a single domain and 11 or more fully populated system boards might hang under heavy stress.

Do not exceed 170 CPU threads.

 

Limit the number of CPU threads to one per CPU core by using the Solaris psradm command to disable the excess CPU threads. For example, disable all odd-numbered CPU threads.

6416224

System performance can degrade using a single NIC card with more than 5,000 connections.

For Solaris 10 11/06, this has been fixed in patch 120011-08.

 

Use multiple NIC cards to split network connections.

6441349

I/O error can hang the system.

For Solaris 10 11/06, this has been fixed in patch 120011-07.

 

There is no workaround.

6485555

On-board Gigabit Ethernet NVRAM corruption could occur due to a race condition. The window of opportunity for this race condition is very small.

For Solaris 10 11/06, this has been fixed in patch 120011-08.

 

There is no workaround.

6496337

The “cpumem-diagnosis” module may fail to load after uncorrectable error(UE) panic. Systems will function correctly but events normally automatically diagnosed by FMA using this module will require manual diagnosis.

Example:

SUNW-MSG-ID: FMD-8000-2K, TYPE:
Defect, VER: 1, SEVERITY: Minor
EVENT-TIME: Thu Feb 15 15:46:57 JST
2007
PLATFORM: SUNW,SPARC-Enterprise,
CSN: BE80601007, HOSTNAME: col2-ffem7-
d0

For Solaris 10 11/06, this has been fixed in patch 125369-05.

 

If the problem has already occurred, use this

workaround:

  1. Remove the cpumemdiagnosis file:
# rm /var/fm/fmd/ckpt/
cpumemdiagnosis/cpumem 
-diagnosis
  1. Restart fmd service:

# svcadm restart fmd

 

To avoid this problem in advance, add “rm -f /var/fm/fmd/ckpt/cpumemdiagnosis/

cpumem-diagnosis” in the /lib/svc/method/svc-dumpadm file as below.

#
savedev=none
rm -f /var/fm/fmd/ckpt/cpumemdiagnosis/
cpumem-diagnosis
#

6495303

The use of a PCIe Dual-Port Ultra320 SCSI controller card (SG-(X)PCIE2SCSIU320Z) in IOU Slot 1 on a Sun SPARC Enterprise M4000/M5000 server might result in a system panic.

Do not use this card in IOU Slot 1.

6498283

Using the DR deleteboard command while psradm operations are running on a domain might cause a system panic.

There is no workaround.

6499304

Unexpected message is displayed on console and CPU isn’t offlined when numerous correctable error(CE) occur.

Example:

SUNW-MSG-ID: FMD-8000-11, TYPE:
Defect, VER: 1, SEVERITY: Minor
EVENT-TIME: Fri Feb 2 18:31:07 JST
2007
PLATFORM: SPARC-Enterprise, CSN:
BE80601035, HOSTNAME: FF2-35-0

For Solaris 10 11/06, this has been fixed in patch 125369-05.

 

Check CPU status on XSCF.

6502204

Unexpected error messages may be displayed on console on booting after CPU UE panic.

Example:

SUNW-MSG-ID: FMD-8000-11, TYPE:
Defect, VER: 1, SEVERITY: Minor
EVENT-TIME: Tue Jan 9 20:45:08 JST
2007
PLATFORM: SUNW,SPARC-Enterprise,
CSN: 2030636002, HOSTNAME: P2-DC1-
16-d0

For Solaris 10 11/06, this has been fixed in patch 125369-05.

 

If you see unexpected messages, use the showdomainstatus(8) command to check system status on XSCF.

6502750

Inserted or removed hotplugged PCI card may not output notification message.

For Solaris 10 11/06, this has been fixed in patch 120011-08.

 

There is no workaround.

6508432

A large number of spurious PCIe correctable errors can be recorded in the FMA error log.

 

For Solaris 10 11/06, this has been fixed in patch 120011-08.

 

To mask these errors, add the following entry to /etc/system and reboot the system:

set pcie:pcie_aer_ce_mask = 0x2001

6508434

The domain may panic when an additional PCI-X card is installed or a PCI-X card is replaced using PCI hot-plug.

For Solaris 10 11/06, this has been fixed in patch 120011-08.

 

Do not insert a different type of PCI-X card on the same PCI slot by using PCI hot-plug.

6510861

When using the PCIe Dual-Port Ultra320 SCSI controller card (SG-(X)PCIE2SCSIU320Z), a PCIe correctable error causes a Solaris panic.

For Solaris 10 11/06, this has been fixed in patch 120011-08.

 

Add the following entry to /etc/system to prevent the problem:

set pcie:pcie_aer_ce_mask = 0x31c1

6520990

When a domain reboots, SCF might not be able to service other domains that share the same physical board. DR operation can exceed the default timeout period and panic can occur.

Increase the DR timeout period by setting the following statement in /etc/system and reboot your system.:

set drmach:fmem_timeout = 30

6527781

The cfgadm command fails while moving the DVD/DAT drive between two domains.

There is no workaround. To reconfigure DVD/Tape drive, execute reboot -r from the domain exhibiting the problem.

6530178

DR addboard command can hang. Once the problem is observed, further DR operations are blocked. Recovery requires reboot of the domain.

For Solaris 10 11/06, this has been fixed in patch 120011-07.

 

There is no workaround.

6530288

cfgadm(1M) command may not correctly show Ap_Id format.

For Solaris 10 11/06, this has been fixed in patch 120011-07.

 

There is no workaround.

6534471

Systems might panic/trap during normal operation.

This bug has been fixed in Solaris 10 8/07. Check for the availability of a patch for this bug.

If a patch is not available, disable the kernel large page sTLB programming. In the file /etc/system, change the heaplp_use_stlb variable to 0:

set heaplp_use_stlb=0

6535564

PCI hot-plug to PCI slot #0, #1 or External I/O Expansion Unit may fail on XSB added by DR.

For Solaris 10 11/06, this has been fixed in patch 120011-08.

 

Use DR instead of PCI hot-plug if need to add or remove PCI card on the XSB.

6539084

There is a low probability of a domain panic during reboot when the Sun Quad GbE UTP x8 PCIe (X4447A-Z) card is present in a domain.

A fix is available in patch 125670-01.

6539909

Do not use the following I/O cards for network access when you are using the boot net install command to install the Solaris OS:

  • X4447A-Z/X4447A-Z, PCIe Quad-port Gigabit Ethernet Adapter UTP
  • X1027A-Z/X1027A-Z, PCIe Dual 10 Gigabit Ethernet Fiber XFP

Use an alternative type of network card or onboard network device to install the Solaris OS via the network.

 

6542632

Memory leak in PCIe module if driver attach fails.

For Solaris 10 11/06, this has been fixed in patch 120011-09.

 

There is no workaround.



Documentation Updates

This section contains late-breaking information that became known after the documentation set was published.

The corrections for Sun SPARC Enterprise M3000/M4000/M5000/M8000/M9000 Servers XSCF Reference Manual applies to the manual only, the XSCF man page is correct.

TABLE 6 lists known documentation corrections.

 


TABLE 6 Documentation Corrections

Document

Issue

Change

SPARC Enterprise

M3000/M4000/M5000/

M8000/M9000 Servers

XSCF User‘s Guide

page 2-34

The description, “When you set the lockout time to 0 minutes to disable the account lockout function, the success of the first login with any user account will disable the function, whereas the failure of the first login will not disable the function. To disable the account lockout function you must set 0 minutes again.”, will be changed as follows:

“setloginlockout -s 0 will disable the account lockout.

When the account lockout is disabled, a user can attempt to login, and fail, an unlimited number of times. If a user needs to access their locked account before the lockout time is complete they must get an administrator to disable the account lockout to allow them to login and then re-enable the lockout

by setting a lockout time.”

SPARC Enterprise

M3000/M4000/M5000/

M8000/M9000 Servers

XSCF User‘s Guide

page 3-4

The description, “The domain console is not forcibly logged out.”, will be changed as follows:

“When you return to XSCF shell console without logging out from the domain, the return causes automatically logging out from the domain. For detailed instructions on setting the session timeout value for domain console, see the Solaris OS

manual.”

Sun SPARC Enterprise M3000/M4000/M5000/ M8000/M9000 Servers XSCF Reference Manual

 

New commands, the setloginlockout(8) and the

showloginlockout(8) do not appear in the Reference Manual.

For details, refer to the man pages.

Sun SPARC Enterprise M3000/M4000/M5000/ M8000/M9000 Servers XSCF Reference Manual

setssh(8) command

The following new option does not appear in the manual:

-m dscp=mode

For details, refer to the setssh man page.

Sun SPARC Enterprise M3000/M4000/M5000/ M8000/M9000 Servers XSCF Reference Manual

showenvironment(8)

command

The -power option does not appear in the manual.

For details, refer to the showenvironment man pages.

Sun SPARC Enterprise M3000/M4000/M5000/ M8000/M9000 Servers XSCF Reference Manual

showssh(8) command

The description of displaying the current settings of the SSH does not appear in the manual.

For details, refer to the showssh man pages.

Sun SPARC Enterprise M3000/M4000/M5000/ M8000/M9000 Servers XSCF Reference Manual

traceroute(8) command

The following description appears under Privileges but is incorrect:

  • To execute the command to DSCP address: fieldeng

The following description should appear under OPERANDS but does not:

When used to specify DSCP address to host, an error occurs.