本章讲述了关闭和引导群集及单个群集节点的过程。
有关本章中相关过程的高级说明,请参阅表 3–1 和表 3–2。
Sun Cluster scshutdown (1M) 命令有序停止群集服务并完全关闭整个群集。 在移动群集位置时可能会用到 scshutdown 命令。 如果因应用程序错误造成数据损坏,也可以使用该命令。
请使用 scshutdown 命令而不是 shutdown 命令或 halt 命令以确保正确关闭整个群集。 将 Solaris shutdown 命令与 scswitch(1M) 命令结合使用可关闭各个节点。 有关详细信息,请参阅如何关闭群集或关闭和引导单个群集节点。
scshutdown 命令通过以下方式停止群集中的所有节点:
使所有正在运行的资源组脱机。
卸载所有群集文件系统。
关闭活动的设备服务。
运行 init 0,并使所有节点转到基于 SPARC 系统上的 OpenBootTM PROM ok 提示下 ,或转到基于 x86 系统上的引导子系统中。 有关引导子系统的详细说明,请参阅 System Administration Guide: Basic Administration 中的 “Boot Subsystems”。
如有必要,您可以以非群集模式引导节点,使该节点不成为群集成员。 非群集模式在安装群集软件或执行某些管理过程时有用。 有关详细信息,请参阅如何在非群集模式下引导群集节点。
任务 |
有关说明 |
---|---|
停止群集 - 使用 scshutdown (1M) |
请参阅如何关闭群集 |
通过引导所有节点来启动群集。 节点必须具有到群集互连的有效连接才能获得群集成员的身份。 |
请参阅如何引导群集 |
重新引导群集 - 使用 scshutdown 在 ok 提示下或“当前引导参数”屏幕上的 Select (b)oot or (i)nterpreter 提示下,使用 boot(1M) 或b 命令分别引导每个节点。 节点必须具有到群集互连的有效连接才能获得群集成员的身份。 |
请参阅如何重新引导群集 |
不要在群集控制台上用 send brk 来关闭群集节点。 群集内部不支持该命令。
SPARC:如果群集上运行的是 Oracle Parallel Server 或 Real Application Cluster,请关闭数据库的所有实例。
有关关闭过程,请参阅 Oracle Parallel Server/Real Application Clusters 产品文档。
成为群集中任一节点上的超级用户。
立即关闭群集。
从群集中的单个节点,键入以下命令。
# scshutdown -g0 -y |
检验在基于 x86 的系统上的引导子系统中或在基于 SPARC 的系统上的所有节点是否都显示了 ok 提示。
在基于 SPARC 的系统上或基于 x86 的系统的引导子系统中,所有群集节点都显示 ok 提示后,才能关闭所有节点电源。
如有必要,关闭节点电源。
以下示例说明了在停止正常的群集操作,并关闭所有节点,以使 ok 提示被显示时控制台所输出的内容。 -g 0 选项将关闭宽延期设置为零,而 -y 表示在遇到确认问题时,自动回答 yes。 群集中其它节点的控制台上也显示关闭消息。
# scshutdown -g0 -y Wed Mar 10 13:47:32 phys-schost-1 cl_runtime: WARNING: CMM monitoring disabled. phys-schost-1# INIT: New run level: 0 The system is coming down. Please wait. System services are now being stopped. /etc/rc0.d/K05initrgm: Calling scswitch -S (evacuate) The system is down. syncing file systems... done Program terminated ok |
以下示例说明了在停止正常的群集操作,并关闭所有节点之后控制台输出的内容。 -g 0 选项将关闭宽延期设置为零,而 -y 表示在遇到确认问题时,自动回答 yes。 群集中其它节点的控制台上也显示关闭消息。
# scshutdown -g0 -y May 2 10:32:57 phys-schost-1 cl_runtime: WARNING: CMM: Monitoring disabled. root@phys-schost-1# INIT: New run level: 0 The system is coming down. Please wait. System services are now being stopped. /etc/rc0.d/K05initrgm: Calling scswitch -S (evacuate) failfasts already disabled on node 1 Print services already stopped. May 2 10:33:13 phys-schost-1 syslogd: going down on signal 15 The system is down. syncing file systems... done Type any key to continue |
要重新启动已关闭的群集,请参阅如何引导群集。
要启动一个群集,其中的节点已关闭,并且节点处于 ok 提示下或“当前引导参数”屏幕上的 Select (b)oot or (i)nterpreter 提示下,boot(1M) 每个节点。
如果在两次关闭操作之间进行了配置更改,则首先启动具有最新配置的节点。 除上述情形外,节点的引导顺序无关紧要。
SPARC:
ok boot |
x86:
<<< Current Boot Parameters >>> Boot path: /pci@0,0/pci8086,2545@3/pci8086,1460@1d/pci8086,341a@7,1/ sd@0,0:a Boot args: Type b [file-name] [boot-flags] <ENTER> to boot with options or i <ENTER> to enter boot interpreter or <ENTER> to boot with defaults <<< timeout in 5 seconds >>> Select (b)oot or (i)nterpreter: b |
激活群集组件时,已引导节点的控制台上将显示消息。
群集节点必须具有到群集互连的有效连接才能获得群集成员的身份。
验证引导节点时未发生错误,而且节点现在处于联机状态。
scstat(1M) 命令将报告节点的状况。
# scstat -n |
如果群集节点的 /var 文件系统已满,可能无法在该节点上重新启动 Sun Cluster。 如果出现此问题,请参阅如何修复已满的 /var 文件系统。
以下示例说明了将节点 phys-schost-1 引导到群集中时控制台所输出的内容。 类似消息出现在群集中其它节点的控制台上。
ok boot Rebooting with command: boot ... Hostname: phys-schost-1 Booting as part of a cluster NOTICE: Node phys-schost-1 with votecount = 1 added. NOTICE: Node phys-schost-2 with votecount = 1 added. NOTICE: Node phys-schost-3 with votecount = 1 added. ... NOTICE: Node phys-schost-1: attempting to join cluster ... NOTICE: Node phys-schost-2 (incarnation # 937690106) has become reachable. NOTICE: Node phys-schost-3 (incarnation # 937690290) has become reachable. NOTICE: cluster has reached quorum. NOTICE: node phys-schost-1 is up; new incarnation number = 937846227. NOTICE: node phys-schost-2 is up; new incarnation number = 937690106. NOTICE: node phys-schost-3 is up; new incarnation number = 937690290. NOTICE: Cluster members: phys-schost-1 phys-schost-2 phys-schost-3. ... |
以下示例说明了将节点 phys-schost-1 引导到群集中时控制台所输出的内容。 群集中其它节点的控制台上会显示类似的消息。
ATI RAGE SDRAM BIOS P/N GR-xlint.007-4.330 * BIOS Lan-Console 2.0 Copyright (C) 1999-2001 Intel Corporation MAC ADDR: 00 02 47 31 38 3C AMIBIOS (C)1985-2002 American Megatrends Inc., Copyright 1996-2002 Intel Corporation SCB20.86B.1064.P18.0208191106 SCB2 Production BIOS Version 2.08 BIOS Build 1064 2 X Intel(R) Pentium(R) III CPU family 1400MHz Testing system memory, memory size=2048MB 2048MB Extended Memory Passed 512K L2 Cache SRAM Passed ATAPI CD-ROM SAMSUNG CD-ROM SN-124 Press <F2> to enter SETUP, <F12> Network Adaptec AIC-7899 SCSI BIOS v2.57S4 (c) 2000 Adaptec, Inc. All Rights Reserved. Press <Ctrl><A> for SCSISelect(TM) Utility! Ch B, SCSI ID: 0 SEAGATE ST336605LC 160 SCSI ID: 1 SEAGATE ST336605LC 160 SCSI ID: 6 ESG-SHV SCA HSBP M18 ASYN Ch A, SCSI ID: 2 SUN StorEdge 3310 160 SCSI ID: 3 SUN StorEdge 3310 160 AMIBIOS (C)1985-2002 American Megatrends Inc., Copyright 1996-2002 Intel Corporation SCB20.86B.1064.P18.0208191106 SCB2 Production BIOS Version 2.08 BIOS Build 1064 2 X Intel(R) Pentium(R) III CPU family 1400MHz Testing system memory, memory size=2048MB 2048MB Extended Memory Passed 512K L2 Cache SRAM Passed ATAPI CD-ROM SAMSUNG CD-ROM SN-124 SunOS - Intel Platform Edition Primary Boot Subsystem, vsn 2.0 Current Disk Partition Information Part# Status Type Start Length ================================================ 1 Active X86 BOOT 2428 21852 2 SOLARIS 24280 71662420 3 <unused> 4 <unused> Please select the partition you wish to boot: * * Solaris DCB loading /solaris/boot.bin SunOS Secondary Boot version 3.00 Solaris Intel Platform Edition Booting System Autobooting from bootpath: /pci@0,0/pci8086,2545@3/pci8086,1460@1d/ pci8086,341a@7,1/sd@0,0:a If the system hardware has changed, or to boot from a different device, interrupt the autoboot process by pressing ESC. Press ESCape to interrupt autoboot in 2 seconds. Initializing system Please wait... Warning: Resource Conflict - both devices are added NON-ACPI device: ISY0050 Port: 3F0-3F5, 3F7; IRQ: 6; DMA: 2 ACPI device: ISY0050 Port: 3F2-3F3, 3F4-3F5, 3F7; IRQ: 6; DMA: 2 <<< Current Boot Parameters >>> Boot path: /pci@0,0/pci8086,2545@3/pci8086,1460@1d/pci8086,341a@7,1/ sd@0,0:a Boot args: Type b [file-name] [boot-flags] <ENTER> to boot with options or i <ENTER> to enter boot interpreter or <ENTER> to boot with defaults <<< timeout in 5 seconds >>> Select (b)oot or (i)nterpreter: Size: 275683 + 22092 + 150244 Bytes /platform/i86pc/kernel/unix loaded - 0xac000 bytes used SunOS Release 5.9 Version Generic_112234-07 32-bit Copyright 1983-2003 Sun Microsystems, Inc. All rights reserved. Use is subject to license terms. configuring IPv4 interfaces: e1000g2. Hostname: phys-schost-1 Booting as part of a cluster NOTICE: CMM: Node phys-schost-1 (nodeid = 1) with votecount = 1 added. NOTICE: CMM: Node phys-schost-2 (nodeid = 2) with votecount = 1 added. NOTICE: CMM: Quorum device 1 (/dev/did/rdsk/d1s2) added; votecount = 1, bitmask of nodes with configured paths = 0x3. NOTICE: clcomm: Adapter e1000g3 constructed NOTICE: clcomm: Path phys-schost-1:e1000g3 - phys-schost-2:e1000g3 being constructed NOTICE: clcomm: Path phys-schost-1:e1000g3 - phys-schost-2:e1000g3 being initiated NOTICE: clcomm: Path phys-schost-1:e1000g3 - phys-schost-2:e1000g3 online NOTICE: clcomm: Adapter e1000g0 constructed NOTICE: clcomm: Path phys-schost-1:e1000g0 - phys-schost-2:e1000g0 being constructed NOTICE: CMM: Node phys-schost-1: attempting to join cluster. NOTICE: clcomm: Path phys-schost-1:e1000g0 - phys-schost-2:e1000g0 being initiated NOTICE: CMM: Quorum device /dev/did/rdsk/d1s2: owner set to node 1. NOTICE: CMM: Cluster has reached quorum. NOTICE: CMM: Node phys-schost-1 (nodeid = 1) is up; new incarnation number = 1068496374. NOTICE: CMM: Node phys-schost-2 (nodeid = 2) is up; new incarnation number = 1068496374. NOTICE: CMM: Cluster members: phys-schost-1 phys-schost-2. NOTICE: CMM: node reconfiguration #1 completed. NOTICE: CMM: Node phys-schost-1: joined cluster. |
运行 scshutdown( 1M) 命令以关闭群集,然后在每个节点上使用 boot( 1M) 命令引导群集。
SPARC:如果群集运行的是 Oracle Parallel Server/Real Application Clusters,则应关闭数据库的所有实例。
有关关闭过程,请参阅 Oracle Parallel Server/Real Application Clusters 产品文档。
成为群集中任一节点上的超级用户。
关闭群集。
从群集中的单个节点中键入以下命令。
# scshutdown -g0 -y |
将关闭所有节点。
群集节点必须具有到群集互连的有效连接才能获得群集成员的身份。
引导每个节点。
节点的引导次序无关紧要,除非您在两次关闭操作之间进行了配置更改。 如果在两次关闭操作之间进行了配置更改,则首先启动具有最新配置的节点。
SPARC:
ok boot |
x86:
<<< Current Boot Parameters >>> Boot path: /pci@0,0/pci8086,2545@3/pci8086,1460@1d/pci8086,341a@7,1/ sd@0,0:a Boot args: Type b [file-name] [boot-flags] <ENTER> to boot with options or i <ENTER> to enter boot interpreter or <ENTER> to boot with defaults <<< timeout in 5 seconds >>> Select (b)oot or (i)nterpreter: b |
验证引导节点时未发生错误,而且节点现在处于联机状态。
scstat 命令报告节点状态。
# scstat -n |
如果群集节点的 /var 文件系统已满,可能无法在该节点上重新启动 Sun Cluster。 如果出现此问题,请参阅如何修复已满的 /var 文件系统。
以下示例显示在停止了正常的群集操作,并关闭了所有节点,使其转到 ok 提示下然后再重新引导群集时控制台所输出的内容。 -g 0 选项将宽延期设置为零,而 -y 表示在遇到确认问题时,自动回答 yes。 群集中其它节点的控制台上也显示关闭消息。
# scshutdown -g0 -y Wed Mar 10 13:47:32 phys-schost-1 cl_runtime: WARNING: CMM monitoring disabled. phys-schost-1# INIT: New run level: 0 The system is coming down. Please wait. ... The system is down. syncing file systems... done Program terminated ok boot Rebooting with command: boot ... Hostname: phys-schost-1 Booting as part of a cluster ... NOTICE: Node phys-schost-1: attempting to join cluster ... NOTICE: Node phys-schost-2 (incarnation # 937690106) has become reachable. NOTICE: Node phys-schost-3 (incarnation # 937690290) has become reachable. NOTICE: cluster has reached quorum. ... NOTICE: Cluster members: phys-schost-1 phys-schost-2 phys-schost-3. ... NOTICE: Node phys-schost-1: joined cluster ... The system is coming up. Please wait. checking ufs filesystems ... reservation program successfully exiting Print services started. volume management starting. The system is ready. phys-schost-1 console login: NOTICE: Node phys-schost-1: joined cluster ... The system is coming up. Please wait. checking ufs filesystems ... reservation program successfully exiting Print services started. volume management starting. The system is ready. phys-schost-1 console login: |
以下示例显示在停止正常的群集操作,关闭所有节点,再重新引导群集时控制台所输出的内容。 -g 0 选项将宽延期设置为零,而 -y 表示在遇到确认问题时,自动回答 yes。 群集中其它节点的控制台上也显示关闭消息。
# scshutdown -g0 -y May 2 10:32:57 phys-schost-1 cl_runtime: WARNING: CMM: Monitoring disabled. root@phys-schost-1# INIT: New run level: 0 The system is coming down. Please wait. System services are now being stopped. /etc/rc0.d/K05initrgm: Calling scswitch -S (evacuate) failfasts already disabled on node 1 Print services already stopped. May 2 10:33:13 phys-schost-1 syslogd: going down on signal 15 The system is down. syncing file systems... done Type any key to continue ATI RAGE SDRAM BIOS P/N GR-xlint.007-4.330 * BIOS Lan-Console 2.0 Copyright (C) 1999-2001 Intel Corporation MAC ADDR: 00 02 47 31 38 3C AMIBIOS (C)1985-2002 American Megatrends Inc., Copyright 1996-2002 Intel Corporation SCB20.86B.1064.P18.0208191106 SCB2 Production BIOS Version 2.08 BIOS Build 1064 2 X Intel(R) Pentium(R) III CPU family 1400MHz Testing system memory, memory size=2048MB 2048MB Extended Memory Passed 512K L2 Cache SRAM Passed ATAPI CD-ROM SAMSUNG CD-ROM SN-124 Press <F2> to enter SETUP, <F12> Network Adaptec AIC-7899 SCSI BIOS v2.57S4 (c) 2000 Adaptec, Inc. All Rights Reserved. Press <Ctrl><A> for SCSISelect(TM) Utility! Ch B, SCSI ID: 0 SEAGATE ST336605LC 160 SCSI ID: 1 SEAGATE ST336605LC 160 SCSI ID: 6 ESG-SHV SCA HSBP M18 ASYN Ch A, SCSI ID: 2 SUN StorEdge 3310 160 SCSI ID: 3 SUN StorEdge 3310 160 AMIBIOS (C)1985-2002 American Megatrends Inc., Copyright 1996-2002 Intel Corporation SCB20.86B.1064.P18.0208191106 SCB2 Production BIOS Version 2.08 BIOS Build 1064 2 X Intel(R) Pentium(R) III CPU family 1400MHz Testing system memory, memory size=2048MB 2048MB Extended Memory Passed 512K L2 Cache SRAM Passed ATAPI CD-ROM SAMSUNG CD-ROM SN-124 SunOS - Intel Platform Edition Primary Boot Subsystem, vsn 2.0 Current Disk Partition Information Part# Status Type Start Length ================================================ 1 Active X86 BOOT 2428 21852 2 SOLARIS 24280 71662420 3 <unused> 4 <unused> Please select the partition you wish to boot: * * Solaris DCB loading /solaris/boot.bin SunOS Secondary Boot version 3.00 Solaris Intel Platform Edition Booting System Autobooting from bootpath: /pci@0,0/pci8086,2545@3/pci8086,1460@1d/ pci8086,341a@7,1/sd@0,0:a If the system hardware has changed, or to boot from a different device, interrupt the autoboot process by pressing ESC. Press ESCape to interrupt autoboot in 2 seconds. Initializing system Please wait... Warning: Resource Conflict - both devices are added NON-ACPI device: ISY0050 Port: 3F0-3F5, 3F7; IRQ: 6; DMA: 2 ACPI device: ISY0050 Port: 3F2-3F3, 3F4-3F5, 3F7; IRQ: 6; DMA: 2 <<< Current Boot Parameters >>> Boot path: /pci@0,0/pci8086,2545@3/pci8086,1460@1d/pci8086,341a@7,1/ sd@0,0:a Boot args: Type b [file-name] [boot-flags] <ENTER> to boot with options or i <ENTER> to enter boot interpreter or <ENTER> to boot with defaults <<< timeout in 5 seconds >>> Select (b)oot or (i)nterpreter: b Size: 275683 + 22092 + 150244 Bytes /platform/i86pc/kernel/unix loaded - 0xac000 bytes used SunOS Release 5.9 Version Generic_112234-07 32-bit Copyright 1983-2003 Sun Microsystems, Inc. All rights reserved. Use is subject to license terms. configuring IPv4 interfaces: e1000g2. Hostname: phys-schost-1 Booting as part of a cluster NOTICE: CMM: Node phys-schost-1 (nodeid = 1) with votecount = 1 added. NOTICE: CMM: Node phys-schost-2 (nodeid = 2) with votecount = 1 added. NOTICE: CMM: Quorum device 1 (/dev/did/rdsk/d1s2) added; votecount = 1, bitmask of nodes with configured paths = 0x3. NOTICE: clcomm: Adapter e1000g3 constructed NOTICE: clcomm: Path phys-schost-1:e1000g3 - phys-schost-2:e1000g3 being constructed NOTICE: clcomm: Path phys-schost-1:e1000g3 - phys-schost-2:e1000g3 being initiated NOTICE: clcomm: Path phys-schost-1:e1000g3 - phys-schost-2:e1000g3 online NOTICE: clcomm: Adapter e1000g0 constructed NOTICE: clcomm: Path phys-schost-1:e1000g0 - phys-schost-2:e1000g0 being constructed NOTICE: CMM: Node phys-schost-1: attempting to join cluster. NOTICE: clcomm: Path phys-schost-1:e1000g0 - phys-schost-2:e1000g0 being initiated NOTICE: CMM: Quorum device /dev/did/rdsk/d1s2: owner set to node 1. NOTICE: CMM: Cluster has reached quorum. NOTICE: CMM: Node phys-schost-1 (nodeid = 1) is up; new incarnation number = 1068496374. NOTICE: CMM: Node phys-schost-2 (nodeid = 2) is up; new incarnation number = 1068496374. NOTICE: CMM: Cluster members: phys-schost-1 phys-schost-2. NOTICE: CMM: node reconfiguration #1 completed. NOTICE: CMM: Node phys-schost-1: joined cluster. WARNING: mod_installdrv: no major number for rsmrdt ip: joining multicasts failed (18) on clprivnet0 - will use link layer broadcasts for multicast The system is coming up. Please wait. checking ufs filesystems /dev/rdsk/c1t0d0s5: is clean. NOTICE: clcomm: Path phys-schost-1:e1000g0 - phys-schost-2:e1000g0 online NIS domain name is dev.eng.mycompany.com starting rpc services: rpcbind keyserv ypbind done. Setting netmask of e1000g2 to 255.255.255.0 Setting netmask of e1000g3 to 255.255.255.128 Setting netmask of e1000g0 to 255.255.255.128 Setting netmask of clprivnet0 to 255.255.255.0 Setting default IPv4 interface for multicast: add net 224.0/4: gateway phys-schost-1 syslog service starting. obtaining access to all attached disks ***************************************************************************** * * The X-server can not be started on display :0... * ***************************************************************************** volume management starting. Starting Fault Injection Server... The system is ready. phys-schost-1 console login: |
结合使用 scswitch( 1M) 命令和 Solaris shutdown(1M) 命令来关闭单个节点。 仅在关闭整个群集时使用 scshutdown 命令。
任务 |
有关说明 |
---|---|
停止群集节点 - 使用 scswitch (1M) 和 shutdown(1M) | |
启动节点 节点必须具有到群集互连的有效连接才能获得群集成员的身份。 | |
停止和重新启动(重新引导)群集节点 - 使用 scswitch 和 shutdown 节点必须具有到群集互连的有效连接才能获得群集成员的身份。 | |
引导一个节点,使节点不成为群集成员 - 使用 scswitch 和 shutdown,然后使用 boot -x 或 b -x |
不要在群集控制台上用 send brk 来关闭群集节点。 群集内部不支持该命令。
SPARC:如果群集节点运行的是 Oracle Parallel Server/Real Application Clusters,则应关闭数据库的所有实例。
有关关闭过程,请参阅 Oracle Parallel Server/Real Application Clusters 产品文档。
成为要关闭的群集节点上的超级用户。
将所有资源组、资源和设备组从要关闭的节点切换到其它群集成员。
在要关闭的节点上,键入以下命令。
# scswitch -S -h node |
从指定的节点抽出所有的设备服务和资源组。
指定从中切换资源组和设备组的节点。
关闭群集节点。
在要关闭的节点上,键入以下命令。
# shutdown -g0 -y -i0 |
检验群集节点是否显示 ok 提示或在“当前引导参数”屏幕上显示 Select (b)oot or (i)nterpreter 提示。
如有必要,请关闭节点电源。
以下示例显示了关闭节点 phys-schost-1 时控制台所输出的内容。 -g0 选项将宽延期设置为零,-y 表示在遇到确认问题时,自动回答 yes,而 -i0 则调用运行级 0(零)。 群集中其它节点的控制台上也显示此节点的关闭消息。
# scswitch -S -h phys-schost-1 # shutdown -g0 -y -i0 Wed Mar 10 13:47:32 phys-schost-1 cl_runtime: WARNING: CMM monitoring disabled. phys-schost-1# INIT: New run level: 0 The system is coming down. Please wait. Notice: rgmd is being stopped. Notice: rpc.pmfd is being stopped. Notice: rpc.fed is being stopped. umount: /global/.devices/node@1 busy umount: /global/phys-schost-1 busy The system is down. syncing file systems... done Program terminated ok |
以下示例显示了关闭节点 phys-schost-1 时控制台所输出的内容。 -g0 选项将宽延期设置为零,-y 表示在遇到确认问题时,自动回答 yes,而 -i0 则调用运行级 0(零)。 群集中其它节点的控制台上也显示此节点的关闭消息。
# scswitch -S -h phys-schost-1 # shutdown -g0 -y -i0 Shutdown started. Wed Mar 10 13:47:32 PST 2004 Changing to init state 0 - please wait Broadcast Message from root (console) on phys-schost-1 Wed Mar 10 13:47:32... THE SYSTEM phys-schost-1 IS BEING SHUT DOWN NOW ! ! ! Log off now or risk your files being damaged phys-schost-1# INIT: New run level: 0 The system is coming down. Please wait. System services are now being stopped. /etc/rc0.d/K05initrgm: Calling scswitch -S (evacuate) failfasts disabled on node 1 Print services already stopped. Mar 10 13:47:44 phys-schost-1 syslogd: going down on signal 15 umount: /global/.devices/node@2 busy umount: /global/.devices/node@1 busy The system is down. syncing file systems... done WARNING: CMM: Node being shut down. Type any key to continue |
要重新启动已关闭的群集节点,请参阅如何引导群集节点。
群集节点的启动可能受定额配置的影响。 在双节点群集中,您必须配置定额设备,以使该群集的定额总计数为三个。 每个节点应有一个定额计数,定额设备有一个定额计数。 在这种情况下,当第一个节点关闭后,第二个节点仍然有定额设备,并以唯一群集成员的身份运行。 由于第一个节点在群集中要作为群集节点返回,所以第二个节点必须处于打开和运行状态。 必须存在所需的群集定额计数(两个)。
SPARC:
ok boot |
x86:
<<< Current Boot Parameters >>> Boot path: /pci@0,0/pci8086,2545@3/pci8086,1460@1d/pci8086,341a@7,1/ sd@0,0:a Boot args: Type b [file-name] [boot-flags] <ENTER> to boot with options or i <ENTER> to enter boot interpreter or <ENTER> to boot with defaults <<< timeout in 5 seconds >>> Select (b)oot or (i)nterpreter: b |
群集节点必须具有到群集互连的有效连接才能获得群集成员的身份。
验证引导节点时未发生错误,而且节点现在处于联机状态。
scstat 命令报告节点的状况。
# scstat -n |
如果群集节点的 /var 文件系统已满,可能无法在该节点上重新启动 Sun Cluster。 如果出现此问题,请参阅如何修复已满的 /var 文件系统。
以下示例说明了将节点 phys-schost-1 引导到群集中时控制台所输出的内容。
ok boot Rebooting with command: boot ... Hostname: phys-schost-1 Booting as part of a cluster ... NOTICE: Node phys-schost-1: attempting to join cluster ... NOTICE: Node phys-schost-1: joined cluster ... The system is coming up. Please wait. checking ufs filesystems ... reservation program successfully exiting Print services started. volume management starting. The system is ready. phys-schost-1 console login: |
以下示例说明了将节点 phys-schost-1 引导到群集中时控制台所输出的内容。
<<< Current Boot Parameters >>> Boot path: /pci@0,0/pci8086,2545@3/pci8086,1460@1d/pci8086,341a@7,1/sd@0,0:a Boot args: Type b [file-name] [boot-flags] <ENTER> to boot with options or i <ENTER> to enter boot interpreter or <ENTER> to boot with defaults <<< timeout in 5 seconds >>> Select (b)oot or (i)nterpreter: Size: 276915 + 22156 + 150372 Bytes /platform/i86pc/kernel/unix loaded - 0xac000 bytes used SunOS Release 5.9 Version on81-feature-patch:08/30/2003 32-bit Copyright 1983-2003 Sun Microsystems, Inc. All rights reserved. Use is subject to license terms. configuring IPv4 interfaces: e1000g2. Hostname: phys-schost-1 Booting as part of a cluster NOTICE: CMM: Node phys-schost-1 (nodeid = 1) with votecount = 1 added. NOTICE: CMM: Node phys-schost-2 (nodeid = 2) with votecount = 1 added. NOTICE: CMM: Quorum device 1 (/dev/did/rdsk/d1s2) added; votecount = 1, bitmask of nodes with configured paths = 0x3. WARNING: CMM: Initialization for quorum device /dev/did/rdsk/d1s2 failed with error EACCES. Will retry later. NOTICE: clcomm: Adapter e1000g3 constructed NOTICE: clcomm: Path phys-schost-1:e1000g3 - phys-schost-2:e1000g3 being constructed NOTICE: clcomm: Path phys-schost-1:e1000g3 - phys-schost-2:e1000g3 being initiated NOTICE: clcomm: Path phys-schost-1:e1000g3 - phys-schost-2:e1000g3 online NOTICE: clcomm: Adapter e1000g0 constructed NOTICE: clcomm: Path phys-schost-1:e1000g0 - phys-schost-2:e1000g0 being constructed NOTICE: CMM: Node phys-schost-1: attempting to join cluster. WARNING: CMM: Reading reservation keys from quorum device /dev/did/rdsk/d1s2 failed with error 2. NOTICE: CMM: Cluster has reached quorum. NOTICE: CMM: Node phys-schost-1 (nodeid = 1) is up; new incarnation number = 1068503958. NOTICE: CMM: Node phys-schost-2 (nodeid = 2) is up; new incarnation number = 1068496374. NOTICE: CMM: Cluster members: phys-schost-1 phys-schost-2. NOTICE: CMM: node reconfiguration #3 completed. NOTICE: CMM: Node phys-schost-1: joined cluster. NOTICE: clcomm: Path phys-schost-1:e1000g0 - phys-schost-2:e1000g0 being initiated NOTICE: clcomm: Path phys-schost-1:e1000g0 - phys-schost-2:e1000g0 online NOTICE: CMM: Retry of initialization for quorum device /dev/did/rdsk/d1s2 was successful. WARNING: mod_installdrv: no major number for rsmrdt ip: joining multicasts failed (18) on clprivnet0 - will use link layer broadcasts for multicast The system is coming up. Please wait. checking ufs filesystems /dev/rdsk/c1t0d0s5: is clean. NIS domain name is dev.eng.mycompany.com starting rpc services: rpcbind keyserv ypbind done. Setting netmask of e1000g2 to 255.255.255.0 Setting netmask of e1000g3 to 255.255.255.128 Setting netmask of e1000g0 to 255.255.255.128 Setting netmask of clprivnet0 to 255.255.255.0 Setting default IPv4 interface for multicast: add net 224.0/4: gateway phys-schost-1 syslog service starting. obtaining access to all attached disks ***************************************************************************** * * The X-server can not be started on display :0... * ***************************************************************************** volume management starting. Starting Fault Injection Server... The system is ready. phys-schost-1 console login: |
SPARC:如果群集节点运行的是 Oracle Parallel Server/Real Application Clusters,则应关闭数据库的所有实例。
有关关闭过程,请参阅 Oracle Parallel Server/Real Application Clusters 产品文档。
成为要关闭的群集节点上的超级用户。
使用 scswitch 和 shutdown 命令关闭群集节点。
在要关闭的节点上,输入以下命令。 如果结合使用 shutdown 命令和 -i 6 选项,将使节点关闭后重新进行引导。
# scswitch -S -h node # shutdown -g0 -y -i6 |
群集节点必须具有到群集互连的有效连接才能获得群集成员的身份。
验证引导节点时未发生错误,而且节点现在处于联机状态。
# scstat -n |
以下示例显示了重新引导节点 physschost-1 时控制台所输出的内容。 有关该节点的消息(例如关闭和启动通知)会出现在群集中其他节点的控制台上。
# scswitch -S -h phys-schost-1 # shutdown -g0 -y -i6 Shutdown started. Wed Mar 10 13:47:32 phys-schost-1 cl_runtime: WARNING: CMM monitoring disabled. phys-schost-1# INIT: New run level: 6 The system is coming down. Please wait. System services are now being stopped. Notice: rgmd is being stopped. Notice: rpc.pmfd is being stopped. Notice: rpc.fed is being stopped. umount: /global/.devices/node@1 busy umount: /global/phys-schost-1 busy The system is down. syncing file systems... done rebooting... Resetting ... ,,, Sun Ultra 1 SBus (UltraSPARC 143MHz), No Keyboard OpenBoot 3.11, 128 MB memory installed, Serial #5932401. Ethernet address 8:8:20:99:ab:77, Host ID: 8899ab77. ... Rebooting with command: boot ... Hostname: phys-schost-1 Booting as part of a cluster ... NOTICE: Node phys-schost-1: attempting to join cluster ... NOTICE: Node phys-schost-1: joined cluster ... The system is coming up. Please wait. The system is ready. phys-schost-1 console login: |
以下示例显示了重新引导节点 physschost-1 时控制台所输出的内容。 有关该节点的消息(例如关闭和启动通知)会出现在群集中其他节点的控制台上。
# scswitch -S -h phys-schost-1 # shutdown -g0 -y -i6 Shutdown started. Wed Mar 10 13:47:32 PST 2004 Changing to init state 6 - please wait Broadcast Message from root (console) on phys-schost-1 Wed Mar 10 13:47:32... THE SYSTEM phys-schost-1 IS BEING SHUT DOWN NOW ! ! ! Log off now or risk your files being damaged phys-schost-1# INIT: New run level: 6 The system is coming down. Please wait. System services are now being stopped. /etc/rc0.d/K05initrgm: Calling scswitch -S (evacuate) Print services already stopped. Mar 10 13:47:44 phys-schost-1 syslogd: going down on signal 15 umount: /global/.devices/node@2 busy umount: /global/.devices/node@1 busy The system is down. syncing file systems... done WARNING: CMM: Node being shut down. rebooting... ATI RAGE SDRAM BIOS P/N GR-xlint.007-4.330 * BIOS Lan-Console 2.0 Copyright (C) 1999-2001 Intel Corporation MAC ADDR: 00 02 47 31 38 3C AMIBIOS (C)1985-2002 American Megatrends Inc., Copyright 1996-2002 Intel Corporation SCB20.86B.1064.P18.0208191106 SCB2 Production BIOS Version 2.08 BIOS Build 1064 2 X Intel(R) Pentium(R) III CPU family 1400MHz Testing system memory, memory size=2048MB 2048MB Extended Memory Passed 512K L2 Cache SRAM Passed ATAPI CD-ROM SAMSUNG CD-ROM SN-124 Press <F2> to enter SETUP, <F12> Network Adaptec AIC-7899 SCSI BIOS v2.57S4 (c) 2000 Adaptec, Inc. All Rights Reserved. Press <Ctrl><A> for SCSISelect(TM) Utility! Ch B, SCSI ID: 0 SEAGATE ST336605LC 160 SCSI ID: 1 SEAGATE ST336605LC 160 SCSI ID: 6 ESG-SHV SCA HSBP M18 ASYN Ch A, SCSI ID: 2 SUN StorEdge 3310 160 SCSI ID: 3 SUN StorEdge 3310 160 AMIBIOS (C)1985-2002 American Megatrends Inc., Copyright 1996-2002 Intel Corporation SCB20.86B.1064.P18.0208191106 SCB2 Production BIOS Version 2.08 BIOS Build 1064 2 X Intel(R) Pentium(R) III CPU family 1400MHz Testing system memory, memory size=2048MB 2048MB Extended Memory Passed 512K L2 Cache SRAM Passed ATAPI CD-ROM SAMSUNG CD-ROM SN-124 SunOS - Intel Platform Edition Primary Boot Subsystem, vsn 2.0 Current Disk Partition Information Part# Status Type Start Length ================================================ 1 Active X86 BOOT 2428 21852 2 SOLARIS 24280 71662420 3 <unused> 4 <unused> Please select the partition you wish to boot: * * Solaris DCB loading /solaris/boot.bin SunOS Secondary Boot version 3.00 Solaris Intel Platform Edition Booting System Autobooting from bootpath: /pci@0,0/pci8086,2545@3/pci8086,1460@1d/ pci8086,341a@7,1/sd@0,0:a If the system hardware has changed, or to boot from a different device, interrupt the autoboot process by pressing ESC. Press ESCape to interrupt autoboot in 2 seconds. Initializing system Please wait... Warning: Resource Conflict - both devices are added NON-ACPI device: ISY0050 Port: 3F0-3F5, 3F7; IRQ: 6; DMA: 2 ACPI device: ISY0050 Port: 3F2-3F3, 3F4-3F5, 3F7; IRQ: 6; DMA: 2 <<< Current Boot Parameters >>> Boot path: /pci@0,0/pci8086,2545@3/pci8086,1460@1d/pci8086,341a@7,1/ sd@0,0:a Boot args: Type b [file-name] [boot-flags] <ENTER> to boot with options or i <ENTER> to enter boot interpreter or <ENTER> to boot with defaults <<< timeout in 5 seconds >>> Select (b)oot or (i)nterpreter: Size: 276915 + 22156 + 150372 Bytes /platform/i86pc/kernel/unix loaded - 0xac000 bytes used SunOS Release 5.9 Version on81-feature-patch:08/30/2003 32-bit Copyright 1983-2003 Sun Microsystems, Inc. All rights reserved. Use is subject to license terms. configuring IPv4 interfaces: e1000g2. Hostname: phys-schost-1 Booting as part of a cluster NOTICE: CMM: Node phys-schost-1 (nodeid = 1) with votecount = 1 added. NOTICE: CMM: Node phys-schost-2 (nodeid = 2) with votecount = 1 added. NOTICE: CMM: Quorum device 1 (/dev/did/rdsk/d1s2) added; votecount = 1, bitmask of nodes with configured paths = 0x3. WARNING: CMM: Initialization for quorum device /dev/did/rdsk/d1s2 failed with error EACCES. Will retry later. NOTICE: clcomm: Adapter e1000g3 constructed NOTICE: clcomm: Path phys-schost-1:e1000g3 - phys-schost-2:e1000g3 being constructed NOTICE: clcomm: Path phys-schost-1:e1000g3 - phys-schost-2:e1000g3 being initiated NOTICE: clcomm: Path phys-schost-1:e1000g3 - phys-schost-2:e1000g3 online NOTICE: clcomm: Adapter e1000g0 constructed NOTICE: clcomm: Path phys-schost-1:e1000g0 - phys-schost-2:e1000g0 being constructed NOTICE: CMM: Node phys-schost-1: attempting to join cluster. WARNING: CMM: Reading reservation keys from quorum device /dev/did/rdsk/d1s2 failed with error 2. NOTICE: CMM: Cluster has reached quorum. NOTICE: CMM: Node phys-schost-1 (nodeid = 1) is up; new incarnation number = 1068503958. NOTICE: CMM: Node phys-schost-2 (nodeid = 2) is up; new incarnation number = 1068496374. NOTICE: CMM: Cluster members: phys-schost-1 phys-schost-2. NOTICE: CMM: node reconfiguration #3 completed. NOTICE: CMM: Node phys-schost-1: joined cluster. NOTICE: clcomm: Path phys-schost-1:e1000g0 - phys-schost-2:e1000g0 being initiated NOTICE: clcomm: Path phys-schost-1:e1000g0 - phys-schost-2:e1000g0 online NOTICE: CMM: Retry of initialization for quorum device /dev/did/rdsk/d1s2 was successful. WARNING: mod_installdrv: no major number for rsmrdt ip: joining multicasts failed (18) on clprivnet0 - will use link layer broadcasts for multicast The system is coming up. Please wait. checking ufs filesystems /dev/rdsk/c1t0d0s5: is clean. NIS domain name is dev.eng.mycompany.com starting rpc services: rpcbind keyserv ypbind done. Setting netmask of e1000g2 to 255.255.255.0 Setting netmask of e1000g3 to 255.255.255.128 Setting netmask of e1000g0 to 255.255.255.128 Setting netmask of clprivnet0 to 255.255.255.0 Setting default IPv4 interface for multicast: add net 224.0/4: gateway phys-schost-1 syslog service starting. obtaining access to all attached disks ***************************************************************************** * * The X-server can not be started on display :0... * ***************************************************************************** volume management starting. Starting Fault Injection Server... The system is ready. phys-schost-1 console login: |
您还可以引导一个节点,使该节点不成为群集成员(即在非群集模式下)。 非群集模式在安装群集软件或执行某些管理过程(如修补一个节点)时有用。
成为要在非群集模式下启动的节点上的超级用户。
使用 scswitch 和 shutdown 命令关闭节点。
# scswitch -S -h node # shutdown -g0 -y -i0 |
检验节点是否显示 ok 提示或在“当前引导参数”屏幕中显示 Select (b)oot or (i)nterpreter 提示。
使用 boot(1M) 或结合使用 b 命令和 -x 选项在非群集模式下引导节点。
SPARC:
ok boot -x |
x86:
<<< Current Boot Parameters >>> Boot path: /pci@0,0/pci8086,2545@3/pci8086,1460@1d/pci8086,341a@7,1/ sd@0,0:a Boot args: Type b [file-name] [boot-flags] <ENTER> to boot with options or i <ENTER> to enter boot interpreter or <ENTER> to boot with defaults <<< timeout in 5 seconds >>> Select (b)oot or (i)nterpreter: b -x |
以下示例说明关闭节点 phys-schost-1、然后在非群集模式下重新启动该节点时控制台所输出的内容。 -g0 选项将宽延期设置为零,-y 表示在遇到确认问题时,自动回答 yes,而 -i0 则调用运行级 0(零)。 群集中其它节点的控制台上也显示此节点的关闭消息。
# scswitch -S -h phys-schost-1 # shutdown -g0 -y -i0 Shutdown started. Wed Mar 10 13:47:32 phys-schost-1 cl_runtime: WARNING: CMM monitoring disabled. phys-schost-1# ... rg_name = schost-sa-1 ... offline node = phys-schost-2 ... num of node = 0 ... phys-schost-1# INIT: New run level: 0 The system is coming down. Please wait. System services are now being stopped. Print services stopped. syslogd: going down on signal 15 ... The system is down. syncing file systems... done WARNING: node phys-schost-1 is being shut down. Program terminated ok boot -x ... Not booting as part of cluster ... The system is ready. phys-schost-1 console login: |
以下示例说明关闭节点 phys-schost-1、然后在非群集模式下重新启动该节点时控制台所输出的内容。 -g0 选项将宽延期设置为零,-y 表示在遇到确认问题时,自动回答 yes,而 -i0 则调用运行级 0(零)。 群集中其它节点的控制台上也显示此节点的关闭消息。
# scswitch -S -h phys-schost-1 # shutdown -g0 -y -i0 Shutdown started. Wed Mar 10 13:47:32 PST 2004 phys-schost-1# INIT: New run level: 0 The system is coming down. Please wait. System services are now being stopped. Print services already stopped. Mar 10 13:47:44 phys-schost-1 syslogd: going down on signal 15 ... The system is down. syncing file systems... done WARNING: CMM: Node being shut down. Type any key to continue ATI RAGE SDRAM BIOS P/N GR-xlint.007-4.330 * BIOS Lan-Console 2.0 Copyright (C) 1999-2001 Intel Corporation MAC ADDR: 00 02 47 31 38 3C AMIBIOS (C)1985-2002 American Megatrends Inc., Copyright 1996-2002 Intel Corporation SCB20.86B.1064.P18.0208191106 SCB2 Production BIOS Version 2.08 BIOS Build 1064 2 X Intel(R) Pentium(R) III CPU family 1400MHz Testing system memory, memory size=2048MB 2048MB Extended Memory Passed 512K L2 Cache SRAM Passed ATAPI CD-ROM SAMSUNG CD-ROM SN-124 Press <F2> to enter SETUP, <F12> Network Adaptec AIC-7899 SCSI BIOS v2.57S4 (c) 2000 Adaptec, Inc. All Rights Reserved. Press <Ctrl><A> for SCSISelect(TM) Utility! Ch B, SCSI ID: 0 SEAGATE ST336605LC 160 SCSI ID: 1 SEAGATE ST336605LC 160 SCSI ID: 6 ESG-SHV SCA HSBP M18 ASYN Ch A, SCSI ID: 2 SUN StorEdge 3310 160 SCSI ID: 3 SUN StorEdge 3310 160 AMIBIOS (C)1985-2002 American Megatrends Inc., Copyright 1996-2002 Intel Corporation SCB20.86B.1064.P18.0208191106 SCB2 Production BIOS Version 2.08 BIOS Build 1064 2 X Intel(R) Pentium(R) III CPU family 1400MHz Testing system memory, memory size=2048MB 2048MB Extended Memory Passed 512K L2 Cache SRAM Passed ATAPI CD-ROM SAMSUNG CD-ROM SN-124 SunOS - Intel Platform Edition Primary Boot Subsystem, vsn 2.0 Current Disk Partition Information Part# Status Type Start Length ================================================ 1 Active X86 BOOT 2428 21852 2 SOLARIS 24280 71662420 3 <unused> 4 <unused> Please select the partition you wish to boot: * * Solaris DCB loading /solaris/boot.bin SunOS Secondary Boot version 3.00 Solaris Intel Platform Edition Booting System Autobooting from bootpath: /pci@0,0/pci8086,2545@3/pci8086,1460@1d/ pci8086,341a@7,1/sd@0,0:a If the system hardware has changed, or to boot from a different device, interrupt the autoboot process by pressing ESC. Press ESCape to interrupt autoboot in 2 seconds. Initializing system Please wait... Warning: Resource Conflict - both devices are added NON-ACPI device: ISY0050 Port: 3F0-3F5, 3F7; IRQ: 6; DMA: 2 ACPI device: ISY0050 Port: 3F2-3F3, 3F4-3F5, 3F7; IRQ: 6; DMA: 2 <<< Current Boot Parameters >>> Boot path: /pci@0,0/pci8086,2545@3/pci8086,1460@1d/pci8086,341a@7,1/ sd@0,0:a Boot args: Type b [file-name] [boot-flags] <ENTER> to boot with options or i <ENTER> to enter boot interpreter or <ENTER> to boot with defaults <<< timeout in 5 seconds >>> Select (b)oot or (i)nterpreter: b -x ... Not booting as part of cluster ... The system is ready. phys-schost-1 console login: |
Solaris 和 Sun Cluster 软件均将错误消息写入 /var/adm/messages 文件,经过一段时间可能会占满 /var 文件系统。 如果群集节点的 /var 文件系统已满,可能无法在该节点上重新启动 Sun Cluster。 此外,您可能无法登录到此节点。
如果某个节点报告 /var 文件系统已满而且继续运行 Sun Cluster 服务,则请按以下过程来清理被占满的文件系统。 有关详细信息,请参阅 System Administration Guide: Advanced Administration 中的 “Viewing System Messages”。