C H A P T E R 3 |
This chapter explains how to install Sun HPC ClusterTools software on Solaris using the installation utilities. For information about how to install Sun HPC ClusterTools software on a Linux-based system, see Chapter 4.
The Sun HPC ClusterTools software command-line interface (CLI) utilities are:
ctremove – Remove the software from the cluster nodes.
Before you can install and configure the software, you must make sure you have downloaded the correct software archive for your hardware platform. Once you have downloaded the software, you need to extract it to the correct directory. If you have installed a previous version of the software, there are additional steps you need to do to prepare for installation. The following procedure explains these steps.
Download and extract the archive file containing the Sun HPC ClusterTools software to a location (such as a shared file system) that is visible to all the nodes in the cluster.
Make sure that all the compute and administrative nodes have access to the shared file system.
Make sure that the file system is readable by superuser and accessible through a common path from all nodes.
For centralized installations, do this on the central host as well.
You can obtain the correct HPC ClusterTools archive file for your platform from the following location:
Log in as superuser on the system from which you will be initiating the command: the central host or, if initiating the command locally, directly on a cluster node.
If the Sun HPC ClusterTools software has not been installed yet, change directory to one of the following:
If the software was previously installed, remove the previous version of the software by changing the directory to $INSTALL_LOC/SUNWhpc/HPC8.2/bin/Install_Utilities/bin.
$INSTALL_LOC is the location where the software was installed.
To remove a previous version of the software, see Remove the Previous Sun HPC ClusterTools Software Installation.
You can now start using the CLI commands. They are described separately below, with examples of common applications given for each.
For usage information on any command, either enter the command without options or with the –h option.
# ./command or # ./command –h |
If you have previously installed Sun HPC ClusterTools software on your cluster nodes, you must remove it before you can reinstall the software.
Use the ctremove command to remove Sun HPC ClusterTools software from cluster nodes.
See Table 1 for a summary of the ctremove options.
Note - If the nodes are active at the time ctremove is initiated, they will be deactivated automatically before the removal process begins. |
This section shows examples of software removal in which the ctremove command is initiated from a central host.
# ./ctremove –N /tmp/nodelist –r rsh |
This command removes the software from the nodes listed in /tmp/nodelist. The remote connection method is rsh.
# ./ctremove –N /tmp/nodelist –r rsh –k /tmp/cluster-logs –g |
This command is the same as the command in the previous section, except that it specifies the options –k and –g.
Use the ctinstall command to install Sun HPC ClusterTools software on cluster nodes. See TABLE 3-2 for a summary of the ctinstall options.
Note - The options –g, –k, –n, –N, –r, and –S are incompatible with local (non-centralized) installations. If the –l option is used with any of these options, an error message is displayed. |
This section shows examples of software installations in which the ctinstall command is initiated from a central host.
# ./ctinstall –n node1,node2 –r rsh |
This command installs the full Sun HPC ClusterTools software suite on node1 and node2 from a central host. The node list is specified on the command line. The remote connection method is ssh. This requires a trusted hosts setup.
The software will be ready for use when the installation process completes.
# ./ctinstall –n node1,node2 –r ssh |
This example is the same as that in the previous section, except that the remote connection method is ssh. This method requires that the initiating node be able to log in as superuser to the target nodes without being prompted for any interaction, such as a password.
# ./ctinstall –N /tmp/nodelist –r telnet |
This command installs the full Sun HPC ClusterTools software suite on the set of nodes listed in the file /tmp/nodelist from a central host. A node list file is particularly useful when you have a large set of nodes or you want to run operations on the same set of nodes repeatedly.
The node list file has the following contents:
# Node list for the above example node1 node2 |
The remote connection method is telnet. All cluster nodes must share the same password. If some nodes do not use the same password as others, install the software in groups, each group consisting of nodes that use a common password.
The software will be ready for use when the installation process completes.
# ./ctinstall –N /tmp/nodelist –r telnet –k /tmp/cluster-logs –g |
The command in this section is the same as that shown in the previous section, except that it includes the –k and –g options.
In this example, the –k option causes the local log files of all specified nodes to be saved in /tmp/cluster-logs on the central host.
The –g option causes a pair of node list files to be created on the central host in /var/sadm/system/logs/hpc/nodelists. One file, ctinstall.pass$$, contains a list of the nodes on which the installation was successful. The other file, ctinstall.fail$$, lists the nodes on which the installation was unsuccessful. The $$ symbol is replaced by the process number associated with the installation.
These generated node list files can then be used for command retries or in subsequent operations using the –N switch.
The following command installs only the specified software packages.
# ./ctinstall –N /tmp/nodelist –r telnet –p SUNWompi |
This command installs the packages SUNWompi and SUNWompimn on the set of nodes listed in the file /tmp/nodelist. No other packages are installed. The remote connection method is telnet.
Solaris OS Packages lists the packages in the Sun HPC ClusterTools 8.2 installation.
The –p option can be useful if individual packages were not installed on the nodes by ctinstall.
# ./ctinstall –N /tmp/nodelist –r rsh |
This command installs and activates the full Sun HPC ClusterTools software suite on the nodes listed in the file /tmp/nodelist. The remote connection method is rsh.
The following is the Solaris OS package breakdown for the Sun HPC ClusterTools 8.2 (Open MPI) release.
This section shows examples of software installations in which the ctinstall command is initiated on the local node. The installation options shown in this example include:
Note - The options –g, –k, –n, –N, –r, and –S are incompatible with local (non-centralized) installations. If the –l option is used with any of these options, an error message is displayed. |
# ./ctinstall –l |
This command installs the full Sun HPC ClusterTools software suite on the local node only.
# ./ctinstall –l –p SUNWompi,SUNWompimn |
The command in this section installs the packages SUNWompi and SUNWompimn on the local node.
Solaris OS Packages lists the packages in the Sun HPC ClusterTools 8.2 installation.
This chapter describes the final steps needed to get your Sun HPC system ready for use after installing the software, and information about viewing installation log files.
Open Run-Time Environment (ORTE) provides close integration with several distributed resource management (DRM) systems, such as Sun Grid Engine. For information on how close integration works and how to set up the integration for each of the supported resource managers, refer to the Sun HPC ClusterTools Software Administrator’s Guide.
The Sun HPC ClusterTools 8.2 Software User’s Guide contains information about how to set up close integration with Sun Grid Engine.
The Sun HPC ClusterTools 8.2 installation tools log information about installation-related tasks locally on the nodes where installation tasks are performed. The default location for the log files is /var/sadm/system/logs/hpc. If installation tasks are initiated from a central host, a summary log file is also created on the central host.
Two types of log files are created locally on each cluster node where installation operations take place.
Task-specific logs – Separate log files are created for each installation-related task. They are:
These log files contain detailed logging information for the most recent associated task. Each time a task is repeated, its log file is overwritten.
History log – A ct_history.log file is created to store all installation-related tasks performed on the local node. This provides a convenient record of the Sun HPC ClusterTools 8.2 software installation history on the local node. Each time a new installation task is performed on the node, a new log entry is appended to the history log.
These node specific installation log files are created regardless of the installation method used, local or centralized.
When installation tasks are initiated from a central host, a summary log file named ct_summary.log is created on the central host. This log file records the final summary report that is generated by the CLI. The ct_summary.log is not overwritten when a new task is performed. As with the ct_history.log file, new entries are appended to the summary log file.
Copyright © 2009, Sun Microsystems, Inc. All rights reserved.