Sun SPARC Enterprise M4000/M5000 Servers Product Notes
|
|
Sun SPARC Enterprise M4000/M5000 Servers Product Notes
|
This document includes these sections:
Supported Firmware and Software Versions
The following firmware and software versions are supported in this release:
- XSCF Control Package (XCP) 1040 or later is preinstalled in your server.
Note - If the XCP version preinstalled in your server is under XCP 1040, you must upgrade to XSCF Control Package (XCP) 1040 or later. Use the web browser interface, also known as the browser user interface (BUI), to download the flashupdate.
|
- The first version of the Solaris Operating System (OS) to support these servers is the Solaris 10 11/06 OS.
|
Caution - CR ID 6534471: The system may panic or trap during a normal operation. Implement the workaround for CR ID 6534471 or check for the availability of a patch and install it immediately. This CR is listed in the section, Solaris Issues and Workarounds.
|
If you plan to boot your SPARC Enterprise M4000/M5000 server from a Solaris WAN boot server on the network, you must upgrade the wanboot executable. See Booting From a WAN Boot Server for details.
Note - For the latest information on supported firmware and software versions, see Software Resources.
|
Solaris Patch Information
These are the mandatory patches for the Sun SPARC Enterprise M4000/M5000 servers. Install the patches in the following order:
- 118833-36 or later
After installing patch 118833-36, reboot your domain before proceeding.
- 125100-04 or later (Install 118833-36 before 125100-04.)
- 123839-07 or later
- 120068-03 or later
- 125424-01 or later
After installing patch 125424-01, reboot your domain before proceeding.
Note - See Software Resources for information on how to find the latest patches. Installation information and README files are included in the patch download.
|
Known Issues
This section describes known hardware and software issues in this release.
General Functionality Issues and Limitations
- Dynamic Reconfiguration (DR) is not recommended in the cases below, due to the following restrictions on the DR addboard, deleteboard, and moveboard commands:
|
Caution - Use of DR in an unsupported configuration might result in a domain panic or might hang the system.
|
- Domains using the ZFS filesystem cannot use DR.
- SPARC Enterprise M4000/M5000 servers are cold service machines. Hot-swapping of the CPU/Memory board unit (CMU), I/O Unit (IOU), or any eXtended System Control Facility (XSCF) unit is not supported.
- The XSCF web browser interface, also known as the browser user interface (BUI), has limited availability in this release. It can be used for importing the XSCF firmware and it supports the snapshot Full log set collection function. Use the command-line interface (CLI) instead on the Service Processor and the domains for other activities.
- Capacity on Demand (COD) is not supported.
- No more than two of the following cards (or a combination of the two cards) can be used per domain:
- 4447A-Z/X4447A-Z, PCIe Quad-port Gigabit Ethernet Adapter UTP
- 1027A-Z/X1027A-Z, PCIe Dual 10 Gigabit Ethernet Fiber XFP
- PCI Hot-Plug (PHP) is not available for this release. Check with technical support or your sales representative for additional information and software support.
- Power off all domains before upgrading the XCP firmware.
Hardware Installation and Service Issues
This section describes hardware specific issues and workarounds.
Specific Issues and Workarounds
TABLE 1 lists known issues for which a defect change request ID has been assigned. The table also lists possible workarounds.
TABLE 1 Hardware Issues and Workarounds
CR ID
|
Description
|
Workaround
|
6433420
|
The domain console might display a Mailbox timeout or IOCB interrupt timeout error during boot.
|
Issue a reset-all command from the OBP (OK) prompt and reboot.
|
6488846
|
During boot, the domain console might display a checksum error for the SG(X)PCI2SCSIU320-Z SCSI controller I/O card.
|
Check for the availability of the latest controller card firmware.
|
6498780
|
On the Sun SPARC Enterprise M4000/M5000 servers, the OpenBoot PROM (OBP) might not detect the on-board disk (HDD) boot device. Performing a boot disk results in a console message:
Can't locate boot device
|
The PCI or PCI-X plug-in adapter card might not be seated correctly. Reseat the card in slot 0 of the IOU.
|
U320 PCIe SCSI Card
U320 PCIe SCSI card, part numbers 375-3357-01/02, is not supported in PCI cassettes for SPARC Enterprise M4000/M5000 servers. Customers must use
375-3357-03 or later.
DIMM Replacement
The DIMMs are cold FRU replacement components. The entire server must be powered off and the power cords disconnected to replace the DIMMs.
You can mount up to 4 memory boards on the SPARC Enterprise M4000 server and up to 8 memory boards on the SPARC Enterprise M5000 server. The DIMMs on the memory board are grouped into group A and group B.
Here are the DIMM mount conditions:
- In each group, 4 DIMMs are mounted in a unit.
- The capacity of the DIMMs in group A must be equal to or larger than the capacity of the DIMMs in group B. You need not necessarily mount DIMMs in group B.
- In each of the groups, mount the DIMM of same capacity and of same rank. The DIMMs of different capacity can't be mixed in a group.
- The capacity and the rank of DIMM in each group must be the same in the same group in the same unit. You can mount the DIMMs of different capacity or different rank in the different domains or the groups.
To replace with the DIMMs of different capacity or different rank, you need to follow the above conditions on every memory board in the same CMU.
Electrical Specifications
The following changes belong in the SPARC Enterprise M4000/M5000 Servers Site Planning Guide.
TABLE 2 Midrange Servers Electrical Specifications
|
SPARC Enterprise M4000
|
SPARC Enterprise M5000
|
Power draw
|
2,016W
|
3,738W
|
Volt Ampere
|
2,058 VA
|
3,815 VA
|
Heat dissipation
|
6,879 BTUs/hr (7,258 kJ/hr)
|
12,754 BTUs/hr (13,457 kJ/hr)
|
Hardware Documentation Updates
This section contains late-breaking hardware information that became known after the documentation set was published.
TABLE 3 lists known documentation updates.
TABLE 3 Hardware Documentation Updates
Title
|
Page Number
|
Update
|
All SPARC Enterprise M4000/M5000 servers documentation
|
|
All DVD references are now referred to as CD-RW/DVD-RW.
|
Sun SPARC Enterprise M4000/M5000 Servers Site Planning Guide
|
1-7
|
TABLE 1-3 "Midrange Servres Physical Specifications"
Correct numerical value of "Depth" is 810mm/31.9 in. for the SPARC Enterprise M4000/M5000 servers.
|
|
2-4
|
TABLE 2-2 "Midrange Servres Electrical Specifications"
See Electrical Specifications for the changes.
|
Sun SPARC Enterprise M4000/M5000 Servers Service Manual
|
8-6
|
8.1.3, "Installing the PCI Cassette"
See The following changes belong in the Sun SPARC Enterprise M4000/M5000 Servers Service Manual. for the changes
|
|
11-7
|
11.2, "DIMM Replacement"
See DIMM Replacement for the changes.
|
|
C-7
|
TABLE C-5 "Power Supply Feature"
See Electrical Specifications for the changes.
|
The following changes belong in the Sun SPARC Enterprise M4000/M5000 Servers Service Manual.
Installing the PCI Cassette
|
Caution - Do not forcethe PCI cassette into a slot. Doing so can cause damage to the cassette and server.
|
1. Align the PCI cassette on the gray plastic guide and install it into the slot.
2. Lock the lever into place to seat the cassette.
Note - As the lever is moved pressure will build up, then just prior to locking into place the pressure will suddenly release. If the lever locks in place without the pressure release, the card may not be seated correctly. If this happens the card should be removed and reinstalled.
|
Note - When you insert the PCI cassette using hot-swap, the cassette is automatically powered on and configured. Check that the power LED on the cassette is LIT to be certain the cassette is correctly seated.
|
3. Connect all cables to the PCI cassette and reconnect the cable management arm if necessary.
Software Issues
This section describes specific software and firmware issues and workarounds.
XCP Issues and Workarounds
TABLE 4 lists known XCP issues and possible workarounds.
TABLE 4 XCP Issues and Workarounds
CR ID
|
Description
|
Workaround
|
6486286
|
Domain console connection does not cancel shell when disconnected.
|
Always log out of the Solaris (OS) before exiting the console connection.
If you accidentally disconnect the domain console without logging out:
- Log in again to the domain console
- Log out
- Exit the console connection
|
6519877
|
All domains must be powered off before upgrading the XCP firmware.
|
Power off domains before using the flashupdate command to upgrade XCP firmware.
|
6521896
|
If you log in to the XSCF Unit while it is still booting, you may get a bash$ prompt instead of the XSCF> prompt, and be unable to perform most operations.
|
Log out of the bash$ prompt and wait for the XSCF Unit to finish booting.
|
6529635
|
The showdomainstatus -a command shows domain status as Powered Off, but the showboards -a command shows the domain is testing.
|
Use the showboards command to check the status of system power.
The showdomainstatus command takes a longer time to show the correct status.
|
6532036
|
Some commands that update configuration data take a relatively long time to execute.
|
Do not cancel set* commands. They appear to hang, but eventually complete in about 30 seconds.
|
6533158
|
The fault (memory.block.ue) is reported every 10 hours. The error isencountered and reported periodically
|
An uncorrectable error exists in a DIMM and the DIMM should be replaced.
|
6537025
|
If the XSCF Unit reboots unexpectedly three times in a row without staying up for at least one hour, the XSCF Unit will no longer boot and must be replaced.
|
If user action causes the XSCF Unit to reboot, do not repeat the operation to see if it happens again. Allow the XSCF Unit to remain up for at least one hour or, if necessary, use the rebootxscf command to cause a clean reboot before retrying a failed operation.
|
6537345
|
When using the XSCF Web to import a firmware image, if the image is corrupted (for example, if the browser window is closed during import), the flashupdate command might later report an internal error. CR ID 6537996 is similar.
|
Use the command getflashimage -d to delete the corrupted image. If necessary, reboot the XSCF Unit, then use the flashupdate command again to clear the internal error.
|
6537408
|
Attempting to move a COD board using the moveboard command might fail.
|
Use the deleteboard and addboard commands instead of the moveboard command.
|
6538022
|
The XSCF firmware monitors itself and if it detects any inconsistencies, it forces a reboot.
|
There is no workaround. Allow the XSCF Unit to finish rebooting. It will return to normal operation within approximately five minutes.
|
6538564
|
Using the rebootxscf command might result in a process down error, and possibly an FMA event with MSG ID SCF-8005-NE
|
There is no workaround. Check for the availability of a patch for this defect.
|
6543260
|
The showaudit all command shows a long list of defaults in the policy section after the database is cleared.
|
To clear the non-existent user default settings, run the following commands:
setaudit -a opl=enable
setaudit -a opl=default
|
Solaris Issues and Workarounds
TABLE 5 lists Solaris issues and possible workarounds.
TABLE 5 Solaris Issues and Workarounds
CR ID
|
Description
|
Workaround
|
6459540
|
The DAT72 internal tape drive can time out during tape operations.
The device might also be identified by the system as a QIC drive.
Also causes device to be used as QIC vs. DAT"
|
Update the Solaris /kernel/drv/st.conf file with the following lines:
tape-config-list = "QUANTUM DAT DAT72-00", "QUANTUM DAT DAT72-00", "CFGQUANTUMDATDAT7200", "SEAGATE DAT DAT72-00", "SEAGATE DAT DAT72-00", "CFGSEAGATEDAT7200";
CFGQUANTUMDATDAT7200 = 2,0x34,0,0x18619,4,0x47,0x47,0x47,0x47,3,0,600,600,600,600,600,10800;
CFGSEAGATEDAT7200 = 2,0x34,0,0x18619,4,0x47,0x47,0x47,0x47,3,0,600,600,600,600,600,10800;
|
6472153
|
If you create a Solaris install image or boot image on a non-SPARC Enterprise M4000/M5000 sun4u server and use it on a SPARC Enterprise M4000/M5000 sun4u server, the console's TTY flags will not be set correctly. This can cause the console to lose characters during stress.
|
Telnet into the SPARC Enterprise M4000/M5000 server to reset the console's TTY flags as follows:
# sttydefs -r console
# sttydefs -a console -i "9600 hupcl opost onlcr crtscts" -f "9600"
|
6485555
|
On-board Gigabit Ethernet NVRAM corruption could occur due to a race condition. The window of opportunity for this race condition is very small.
|
If the NVRAM is corrupted, the device is not recognized as a network device. Contact your service representative to replace the FRU.
|
6495303
|
The use of a PCIe Dual-Port Ultra320 SCSI controller card (SG-(X)PCIE2SCSIU320Z) in IOU Slot 1 on a SPARC Enterprise M4000/M5000 server may result in a system panic.
|
Do not use this card in IOU Slot 1 on a SPARC Enterprise M4000/M5000 server.
|
6498283
|
Using the DR deleteboard command while psradm operations are running on a domain might cause a system panic.
|
There is no workaround. Check for the availability of a patch for this defect.
|
6505921
|
Correctable error on the system bus controller generates an invalid fault.
|
Create a file /etc/fm/fmd/fmd.conf containing the following lines;
setprop client.buflim 40m
setprop client.memlim 40m
|
6508432
|
A large number of spurious PCIe correctable errors can be recorded in the FMA error log.
|
To mask these errors, add the following entry to the /etc/system file and then reboot the system:
set pcie:pcie_aer_ce_mask=0x31c1
|
6510861
|
When using the PCIe Dual-Port Ultra320 SCSI controller card (SG-(X)PCIE2SCSIU320Z), a PCIe correctable error causes a Solaris panic.
|
To mask these errors, add the following entry to the /etc/system file and then reboot the system:
set pcie:pcie_aer_ce_mask=0x31c1
|
6522017
|
Domains using the ZFS file system cannot use DR.
|
There is no workaround.
|
6530178
|
DR addboard command can hang. Once problem is observed, further DR operations are blocked. Recovery requires reboot of the domain.
|
There is no workaround. Check for the availability of a patch for this defect.
|
6531036
|
The error message network initialization failed can appear repeatedly after boot net installation.
|
There is no workaround. Check for the availability of a patch for this defect.
|
6534471
|
Systems may panic/trap during normal operation.
|
- Make sure you have the correct /etc/system parameter:
set heaplp_use_stlb=0
- If a change to the parameter does not correct in the problem, check for the availability of a patch for this defect.
|
6536564
|
Faults in I/O devices might not be diagnosed correctly by the Solaris Fault Management Architecture and result in a defect.eft.undiagnosable_problem, or might be diagnosed as fault.io.* but identify the wrong IOU.
|
If Solaris panics and reboots due to an I/O fault, use fmdump -eV to view the error report. The device path in the error report will indicate where the error was detected, which will help to isolate the I/O fault.
|
6539084
|
PCIe Quad-port Gigabit Ethernet Adapter UTP card might panic during a reboot.
|
There is no workaround. Check for the availability of a patch for this defect.
|
6539909
|
Do not use the following I/O cards for network access when you are using the boot net install command to install the Solaris OS:
- 4447A-Z/X4447A-Z, PCIe Quad-port Gigabit Ethernet Adapter UTP
- 1027A-Z/X1027A-Z, PCIe Dual 10 Gigabit Ethernet Fiber XFP
|
Use an alternate type of network card or onboard network device to install the Solaris OS via the network.
|
6542632
|
Memory leak in PCIe module if driver attach fails.
|
There is no workaround. Check for the availability of a patch for this defect.
|
6545685
|
If the system has detected Correctible Memory Errors (CE) at power-on self-test (POST), the domains might incorrectly degrade 4 or 8 DIMMs.
|
Increase the memory patrol timeout values used via the following setting in /etc/system:
set mc-opl:mc_max_rewrite_loop = 10000
|
Identifying Permanent Memory in a Target Board
Dynamic reconfiguration is not recommended for production use if the target board (SB/XSB) has permanent memory.
1. Log in to XSCF.
2. Type the following command:
XSCF> showdevices -d domain_id
|
The following example shows a display of the showdevices -d command where 0 is the domain_id.
XSCF> showdevices -d 0
...
Memory:
-------
board perm base domain target deleted remaining
DID XSB mem MB mem MB address mem MB XSB mem MB mem MB
00 00-0 8192 0 0x0000000000000000 24576
00 00-2 8192 1674 0x000003c000000000 24576
00 00-3 8192 0 0x0000034000000000 24576
...
|
The entry for column 4 perm mem MB indicates the presence of permanent memory if the value is non-zero.
The example shows permanent memory on 00-2, with 1674 MB.
If the board includes permanent memory and executes the deleteboard command or moveboard command," the following notification will appear:
System may be temporarily suspended, proceed? [y|n]:
|
3. If a board includes permanent memory, enter n to cancel the DR command.
System may be temporarily suspended, proceed? [y|n]:n
disconnect SB5
DR operation canceled by operator.
XSCF>
|
Booting From a WAN Boot Server
To support booting the SPARC Enterprise M4000/M5000 server from a WAN boot server:
1. Install the Solaris 10 11/06 OS on the WAN boot server.
2. Copy the wanboot executable from that release to the appropriate location on the install server. If you need further instructions, refer to:
http://docs.sun.com/app/docs/doc/817-5504/6mkv4nh65?a=view
3. Create a WAN boot miniroot from the Solaris 10 11/06 OS. If you need further instructions, refer to:
http://docs.sun.com/app/docs/doc/817-5504/6mkv4nh63?a=view
If you do not upgrade the wanboot executable, the SPARC Enterprise M4000/M5000 server will panic, with messages similar to the following:
krtld: load_exec: fail to expand cpu/$CPUkrtld: error during initial load/link phasepanic - boot: exitto64 returned from client program
See http://docs.sun.com/app/docs/doc/817-5504/6mkv4nh5i?a=view for more information on WAN boot.
Abbreviated Man Page for getflashimage
Synopsis
getflashimage [-v] [[-q] -{y|n}] [-u user] [-p proxy [-t proxy_type]] url
getflashimage -l
getflashimage [[-q] -{y|n}] [-d]
getflashimage -h
Description
The getflashimage (8) command downloads a firmware image file for use by the flashupdate (8) command. If any previous image files of the firmware are present on the XSCF unit, they are deleted prior to downloading the new version. You must have platadm or fieldeng privileges to run this command.
Options and Operand
The following table describes the most commonly used options and operand.
-d
|
Deletes all previous firmware image files still on the XSCF unit, then exits.
|
-l
|
Lists firmware image files that are still on the XSCF unit, then exits.
|
-u user
|
Specifies the user name when logging in to a remote ftp or http server that requires authentication. You will be prompted for a password.
|
url
|
Specifies the URL of the firmware image to download.
|
Examples
CODE EXAMPLE 1 Downloading Using a User Name and Password
This example uses the optional -u user option.
XSCF> getflashimage -u jsmith \
http://imageserver/images/FFXCP1041.tar.gz
Existing versions:
Version Size Date
FFXCP1040.tar.gz 46827123 Wed Mar 14 19:11:40 2007
Warning: About to delete old versions.
Continue? [y|n]: y
Password: [not echoed]
Removing FFXCP1040.tar.gz.
0MB received
1MB received
2MB received
...
43MB received
44MB received
45MB received
Download successful: 46827KB at 1016.857KB/s
|
CODE EXAMPLE 2 Listing Available Firmware Image Files
XSCF> getflashimage -l
Existing versions:
Version Size Date
FFXCP1040.tar.gz 46827123 Wed Mar 14 19:11:40 2007
|
CODE EXAMPLE 3 Deleting All Previous Firmware Image Files
XSCF> getflashimage -d
Existing versions:
Version Size Date
FFXCP1040.tar.gz 46827123 Wed Mar 14 19:11:40 2007
Warning: About to delete old versions.
Continue? [y|n]: y
Removing FFXCP1040.tar.gz.
|
Software Documentation Updates
This section contains late-breaking information on the software documentation that became known after the documentation set was published.
TABLE 6 Software Documentation Updates
Document
|
Page Number
|
Change
|
All SPARC Enterprise M4000/M5000 servers documentation
|
|
All DVD references are now referred to as CD-RW/DVD-RW.
|
Sun SPARC Enterprise M4000/M5000/M8000/M9000 Servers XSCF User's Guide
|
|
The list of web browsers supported by the XSCF Web include:
- Microsoft Internet Explorer 6.0 or later
- Firefox 2.0 or later
- Mozilla 1.7.x or later
- Netscape Navigator 7.1 or later
|
Sun SPARC Enterprise M4000/M5000/M8000/M9000 Servers XSCF Reference Manual
|
ioxadm (8) man page
|
The required privileges for the ioxadm(8) command are as follows:
- With platop privileges, you can use the operands: env, list.
- With platadm privileges, you can use the operands: env, list, locator, poweroff, poweron.
- With fieldeng privileges, you can use the operands: env, list, locator, poweroff, poweron, reset, and setled.
The corrections here, if not otherwise specified, also apply to the man pages which XSCF provides. And they supersede the information on the man pages.
|
Sun SPARC Enterprise M4000/M5000/M8000/M9000 Servers XSCF Reference Manual
|
showldap (8) man page
showlookup (8) man page
showcodusage (8) man page
showemailreport (8) man page
|
The man pages for showldap, showlookup, showcodusage, and showemailreport do not state that these commands are available with the fieldeng privilege.
|
Sun SPARC Enterprise M4000/M5000/M8000/M9000 Servers XSCF Reference Manual
|
setaudit man (8) page
showaudit man (8) page
|
The setaudit and showaudit man pages are incorrect with respect to audit class information.
The following are the audit classes and their values:
ACS_SYSTEM 1
ACS_WRITE 2
ACS_READ 4
ACS_LOGIN 8
ACS_AUDIT 16
ACS_DOMAIN 32
ACS_USER 64
ACS_PLATFORM 128
ACS_MODES 256
|
|
getflashimage (8) man page
|
In XCP1040, the new command getflashimage is available, which can be used to download firmware images in place of the XSCF Web. The getflashimage man page is not included online or in the XSCF reference manual.
An abbreviated man page for getflashimage is included in Abbreviated Man Page for getflashimage.
|
Sun SPARC Enterprise M4000/M5000 Servers Product Notes
|
819-6105-11
|
|
Copyright © 2007, Sun Microsystems, Inc. All Rights Reserved.