How to Prepare the Cluster for Upgrade (Standard) (Sun Cluster Software Installation Guide for Solaris OS)

Sun Cluster Software Installation Guide for Solaris OS

How to Prepare the Cluster for Upgrade (Standard)

Perform this procedure to remove the cluster from production before you perform a standard upgrade. On the Solaris 10 OS, perform all steps from the global zone only.

Before You Begin

Perform the following tasks:

Ensure that the configuration meets the requirements for upgrade. See Upgrade Requirements and Software Support Guidelines.
Have available the installation media, documentation, and patches for all software products that you are upgrading, including the following software:
- Solaris OS
- Sun Cluster 3.2 framework
- Sun Cluster 3.2 data services (agents)
- Applications that are managed by Sun Cluster 3.2 data-services
- VERITAS Volume Manager, if applicable
See Patches and Required Firmware Levels in Sun Cluster 3.2 Release Notes for Solaris OS for the location of patches and installation instructions.
If you use role-based access control (RBAC) instead of superuser to access the cluster nodes, ensure that you can assume an RBAC role that provides authorization for all Sun Cluster commands. This series of upgrade procedures requires the following Sun Cluster RBAC authorizations if the user is not superuser:
- solaris.cluster.modify
- solaris.cluster.admin
- solaris.cluster.read
See Role-Based Access Control (Overview) in System Administration Guide: Security Services for more information about using RBAC roles. See the Sun Cluster man pages for the RBAC authorization that each Sun Cluster subcommand requires.

Ensure that the cluster is functioning normally.
1. View the current status of the cluster by running the following command from any node.
  phys-schost% scstat
  See the scstat(1M) man page for more information.
2. Search the /var/adm/messages log on the same node for unresolved error messages or warning messages.
3. Check the volume-manager status.

Notify users that cluster services will be unavailable during the upgrade.

Become superuser on a node of the cluster.

Take each resource group offline and disable all resources.

Take offline all resource groups in the cluster, including those that are in non-global zones. Then disable all resources, to prevent the cluster from bringing the resources online automatically if a node is mistakenly rebooted into cluster mode.
- If you are upgrading from Sun Cluster 3.1 software and want to use the scsetup utility, perform the following steps:
  1. Start the scsetup utility.
    phys-schost# scsetup
    The scsetup Main Menu is displayed.
  2. Type the number that corresponds to the option for Resource groups and press the Return key.
    
    The Resource Group Menu is displayed.
  3. Type the number that corresponds to the option for Online/Offline or Switchover a resource group and press the Return key.
  4. Follow the prompts to take offline all resource groups and to put them in the unmanaged state.
  5. When all resource groups are offline, type q to return to the Resource Group Menu.
  6. Exit the scsetup utility.
    
    Type q to back out of each submenu or press Ctrl-C.
- To use the command line, perform the following steps:
  1. Take each resource offline.
    phys-schost# scswitch -F -g resource-group
    -F
    
    Switches a resource group offline.
    
    -g resource-group
    
    Specifies the name of the resource group to take offline.
  2. From any node, list all enabled resources in the cluster.
    phys-schost# scrgadm -pv | grep "Res enabled" (resource-group:resource) Res enabled: True
  3. Identify those resources that depend on other resources.
    
    You must disable dependent resources first before you disable the resources that they depend on.
  4. Disable each enabled resource in the cluster.
    phys-schost# scswitch -n -j resource
    -n
    
    Disables.
    
    -j resource
    
    Specifies the resource.
    
    See the scswitch(1M) man page for more information.
  5. Verify that all resources are disabled.
    phys-schost# scrgadm -pv | grep "Res enabled" (resource-group:resource) Res enabled: False
  6. Move each resource group to the unmanaged state.
    phys-schost# scswitch -u -g resource-group
    -u
    
    Moves the specified resource group to the unmanaged state.
    
    -g resource-group
    
    Specifies the name of the resource group to move into the unmanaged state.

Verify that all resources on all nodes are Offline and that all resource groups are in the Unmanaged state.
phys-schost# scstat

For a two-node cluster that uses Sun StorEdge Availability Suite software or Sun StorageTek^TM Availability Suite software, ensure that the configuration data for availability services resides on the quorum disk.

The configuration data must reside on a quorum disk to ensure the proper functioning of Availability Suite after you upgrade the cluster software.

Become superuser on a node of the cluster that runs Availability Suite software.

Identify the device ID and the slice that is used by the Availability Suite configuration file.
phys-schost# /usr/opt/SUNWscm/sbin/dscfg /dev/did/rdsk/dNsS
In this example output, N is the device ID and S the slice of device N.

Identify the existing quorum device.

phys-schost# scstat -q
-- Quorum Votes by Device --
                     Device Name         Present Possible Status
                     -----------         ------- -------- ------
   Device votes:     /dev/did/rdsk/dQsS  1       1        Online

In this example output, dQsS is the existing quorum device.

If the quorum device is not the same as the Availability Suite configuration-data device, move the configuration data to an available slice on the quorum device.
phys-schost# dd if=`/usr/opt/SUNWesm/sbin/dscfg` of=/dev/did/rdsk/dQsS
Note –
You must use the name of the raw DID device, /dev/did/rdsk/, not the block DID device, /dev/did/dsk/.

If you moved the configuration data, configure Availability Suite software to use the new location.

As superuser, issue the following command on each node that runs Availability Suite software.
phys-schost# /usr/opt/SUNWesm/sbin/dscfg -s /dev/did/rdsk/dQsS

(Optional) If you are upgrading from a version of Sun Cluster 3.0 software and do not want your ntp.conf file renamed to ntp.conf.cluster, create an ntp.conf.cluster file.

On each node, copy /etc/inet/ntp.cluster as ntp.conf.cluster.
phys-schost# cp /etc/inet/ntp.cluster /etc/inet/ntp.conf.cluster
The existence of an ntp.conf.cluster file prevents upgrade processing from renaming the ntp.conf file. The ntp.conf file will still be used to synchronize NTP among the cluster nodes.

Stop all applications that are running on each node of the cluster.

Ensure that all shared data is backed up.

If you will upgrade the Solaris OS and your cluster uses dual-string mediators for Solaris Volume Manager software, unconfigure your mediators.

See Configuring Dual-String Mediators for more information about mediators.
1. Run the following command to verify that no mediator data problems exist.
  phys-schost# medstat -s setname
  -s setname
  
  Specifies the disk set name.
  
  If the value in the Status field is Bad, repair the affected mediator host. Follow the procedure How to Fix Bad Mediator Data.
2. List all mediators.
  
  Save this information for when you restore the mediators during the procedure How to Finish Upgrade to Sun Cluster 3.2 Software.
3. For a disk set that uses mediators, take ownership of the disk set if no node already has ownership.
  phys-schost# scswitch -z -D setname -h node
  -z
  
  Changes mastery.
  
  -D devicegroup
  
  Specifies the name of the disk set.
  
  -h node
  
  Specifies the name of the node to become primary of the disk set.
4. Unconfigure all mediators for the disk set.
  phys-schost# metaset -s setname -d -m mediator-host-list
  -s setname
  
  Specifies the disk set name.
  
  -d
  
  Deletes from the disk set.
  
  -m mediator-host-list
  
  Specifies the name of the node to remove as a mediator host for the disk set.
  
  See the mediator(7D) man page for further information about mediator-specific options to the metaset command.
5. Repeat Step c through Step d for each remaining disk set that uses mediators.

From one node, shut down the cluster.
# scshutdown -g0 -y
See the scshutdown(1M)man page for more information.

Boot each node into noncluster mode.

On SPARC based systems, perform the following command:
ok boot -x

On x86 based systems, perform the following commands:

In the GRUB menu, use the arrow keys to select the appropriate Solaris entry and type e to edit its commands.

The GRUB menu appears similar to the following:

GNU GRUB version 0.95 (631K lower / 2095488K upper memory)
+-------------------------------------------------------------------------+
| Solaris 10 /sol_10_x86                                                  |
| Solaris failsafe                                                        |
|                                                                         |
+-------------------------------------------------------------------------+
Use the ^ and v keys to select which entry is highlighted.
Press enter to boot the selected OS, 'e' to edit the
commands before booting, or 'c' for a command-line.

For more information about GRUB based booting, see Chapter 11, GRUB Based Booting (Tasks), in System Administration Guide: Basic Administration.

In the boot parameters screen, use the arrow keys to select the kernel entry and type e to edit the entry.

The GRUB boot parameters screen appears similar to the following:

GNU GRUB version 0.95 (615K lower / 2095552K upper memory)
+----------------------------------------------------------------------+
| root (hd0,0,a)                                                       |
| kernel /platform/i86pc/multiboot                                     |
| module /platform/i86pc/boot_archive                                  |
+----------------------------------------------------------------------+
Use the ^ and v keys to select which entry is highlighted.
Press 'b' to boot, 'e' to edit the selected command in the
boot sequence, 'c' for a command-line, 'o' to open a new line
after ('O' for before) the selected line, 'd' to remove the
selected line, or escape to go back to the main menu.

Add -x to the command to specify that the system boot into noncluster mode.

[ Minimal BASH-like line editing is supported. For the first word, TAB
lists possible command completions. Anywhere else TAB lists the possible
completions of a device/filename. ESC at any time exits. ]

grub edit> kernel /platform/i86pc/multiboot -x

Press Enter to accept the change and return to the boot parameters screen.

The screen displays the edited command.

GNU GRUB version 0.95 (615K lower / 2095552K upper memory)
+----------------------------------------------------------------------+
| root (hd0,0,a)                                                       |
| kernel /platform/i86pc/multiboot -x                                  |
| module /platform/i86pc/boot_archive                                  |
+----------------------------------------------------------------------+
Use the ^ and v keys to select which entry is highlighted.
Press 'b' to boot, 'e' to edit the selected command in the
boot sequence, 'c' for a command-line, 'o' to open a new line
after ('O' for before) the selected line, 'd' to remove the
selected line, or escape to go back to the main menu.-

Type b to boot the node into noncluster mode.

Note –
This change to the kernel boot parameter command does not persist over the system boot. The next time you reboot the node, it will boot into cluster mode. To boot into noncluster mode instead, perform these steps to again to add the -x option to the kernel boot parameter command.

Ensure that each system disk is backed up.

Next Steps

Upgrade software on each node.

To upgrade Solaris software before you perform Sun Cluster software upgrade, go to How to Upgrade the Solaris OS and Volume Manager Software (Standard).
- You must upgrade the Solaris software to a supported release if Sun Cluster 3.2 software does not support the release of the Solaris OS that your cluster currently runs . See “Supported Products” in Sun Cluster 3.2 Release Notes for Solaris OS for more information.
- If Sun Cluster 3.2 software supports the release of the Solaris OS that you currently run on your cluster, further Solaris software upgrade is optional.
Otherwise, upgrade to Sun Cluster 3.2 software. Go to How to Upgrade Sun Cluster 3.2 Software (Standard).