C H A P T E R  1

Introduction to System Management Services

This manual describes the System Management Services (SMS) 1.4 software that is available with the Sun Fire high-end server system.

This chapter includes the following sections:


Sun Fire High-End Systems

Sun Fire high-end system is a member of the next-generation Sun Fire server family.

The system controller (SC) in Sun Fire high-end systems is a multifunction, Nordica- or Othello-based printed circuit board (PCB), which provides critical services and resources required for the operation and control of the Sun Fire system. In this book, the system controller is called the SC.

A Sun Fire high-end system is often referred to as the platform. System boards within the platform can be logically grouped together into separately bootable systems called dynamic system domains, or simply domains.

Up to 18 domains on the Sun Fire 15K, and up to 9 domains on the Sun Fire 12K can exist simultaneously on a single platform. (Domains are introduced in this chapter, and are described in more detail in SMS Configuration.) The system management services (SMS) software lets you control and monitor domains, as well as the platform itself.

The following list is an overview of the many services the SC provides for the Sun Fire system:

Redundant SCs

There are two SCs within Sun Fire platform. The SC that controls the platform is referred to as the main SC, while the other SC acts as a backup and is called the spare SC. The software running on the SC monitors the SCs to determine when an automatic failover should be performed.

We strongly recommend that the two SCs have the same configuration. This duplication includes the Solaris operating environment, SMS software, security modifications, patch installations, and all other system configurations.

The failover functionality between the SCs is controlled by the daemons running on the main and spare SCs. These daemons communicate across private communication paths built into the Sun Fire platform. Other than the communication of these daemons, there is no special trust relationship between the two SCs.

SMS software packages are installed on the SC. In addition, SMS communicates with the Sun Fire high-end system over an Ethernet connection, see Management Network Services.

SMS 1.4 cannot communicate with SMS 1.3 across the I2 network. If one of the SC's is running SMS 1.3 and the other is running SMS 1.4, the I2 network tests will fail, and the SC's will communicate through HASRAM. For information about the I2 network, see I2 Network.


SMS Features

SMS 1.4 supports Sun Fire high-end servers running the Solaris 8 and Solaris 9 operating environments.



Note - SMS 1.3 is available for Solaris 8 02/02 software. That version will not run on Solaris 9 software without replacing specific driver packages. Conversely, the Solaris 9 version of SMS 1.4 will not run on Solaris 8 02/02 software without replacing specific driver packages. For more information contact your Sun service representative.



SMS 1.4 is compatible with Sun Fire high-end system domains that are running the Solaris 8 02/02 through Solaris 9 operating environment. The commands provided with the SMS software can be used remotely.



Note - Graphical user interfaces for many of the commands in SMS are provided by Sun Management Center. For more information, see Sun Management Center.



SMS enables the platform administrator to perform the following tasks:

In addition, SMS:

SMS enables the domain administrator to perform the following tasks:

The following features are provided in this release of SMS:


System Architecture

SMS architecture is best described as distributed client-server. init(1M) starts (and restarts as necessary) one process: ssd(1M). ssd is responsible for monitoring all other SMS processes and restarting them as necessary. See FIGURE 3-1.

The Sun Fire high-end systems platform, the SC, and other workstations communicate over Ethernet. You perform SMS operations by entering commands on the SC console after remotely logging in to the SC from another workstation on the local area network. You must log in as a user with the appropriate platform or domain privileges if you want to perform SMS operations (such as monitoring and controlling the platform).



Note - If SMS is stopped on the main SC and the other SC is powered off, the domains gracefully shutdown and the platform is powered down. If the remaining SC is simply powered off without a shutdown of SMS, SMS won't have time to power off the platform and the domains will crash.



Dual system controllers are supported within the Sun Fire high-end systems platform. One SC is designated as the primary or main system controller, and the other is designated as the spare system controller. If the main SC fails, the failover capability automatically switches to the spare SC as described in SC Failover.

Most domain configurable units are active components and you need to check the system state before powering off any DCU.



Note - Circuit breakers must be on whenever a board is present, including expander boards, whether or not the board is powered on.



For details, see Power Control.


SMS Administration Environment

Administration tasks on the Sun Fire high-end system are secured by group privilege requirements. Upon installation, SMS installs the following 39 UNIX groups to the /etc/group file.

smsconfig(1M) allows an administrator to add, remove, and list members of platform and domain groups as well as set platform and domain directory privileges using the -a, -r, and -l options.

smsconfig also can configure SMS to use alternate group names including NIS managed groups using the -g option. Group information entries can come from any of the sources for groups specified in the/etc/nsswitch.conf file (refer to nsswitch.conf(4)). For instance, if domain A was known by its domain tag as the "Production Domain," an administrator could create a NIS group with the same name and configure SMS to use this group as the domain A administrator group instead of the default, dmnaadmn. For more information, refer to the System Management Services (SMS) 1.4 Installation Guide, Administration Privileges, and refer to the smsconfig man page.

Network Connections for Administrators

The nature of the Sun Fire high-end systems physical architecture, with an embedded system controller, as well as the supported administrative model (with multiple administrative privileges, and hence multiple administrators) dictates that an administrator utilize a remote network connection from a workstation to access SMS command interfaces to manage the Sun Fire high-end system.



caution icon

Caution - Shutting down a remote workstation while a tip session is active into a Sun Fire high-end system SC will bring both SCs down to the OpenBoot OK prompt. This will not affect the domains and after powering the remote system back on you can restore the SCs by typing go at the OK prompt; however, you should end all tip sessions before shutting down a remote workstation.



Since the administrators provide information to verify their identity (passwords) and might possibly need to display sensitive data, it is important that the remote network connection be secure. Physical separation of the administrative networks provides some security on the Sun Fire high-end system. Multiple external physical network connections are available on each SC. SMS software supports up to two external network communities.

For more information on Sun Fire high-end system networks, see Management Network Services. For more information on securing the Sun Fire high-end system see Security Options.

SMS Operating Environment

You can interact with the SC and the domains on the Sun Fire high-end system by using SMS commands.

SMS provides a command-line interface to the various functions and features it contains.


procedure icon  To Begin Using the SC

1. Boot the SC.

For the examples in this guide, the sc_name is sc0 and sms-user is the user-name of the administrator, operator, configurator, or service personnel logged onto the system.

The privileges allotted to the user are determined by the platform or domain groups to which the user belongs. In these examples, the sms-user is assumed to have both platform and domain administrator privileges, unless otherwise noted.

For more information on the function and creation of SMS user groups, refer to the System Management Services (SMS) 1.4 Installation Guide and see Administration Privileges.



Note - This procedure assumes that smsconfig -m has already been run. If smsconfig -m has not been run, you will receive the following error when SMS attempts to start and SMS will exit.

sms: smsconfig(1M) has not been run. Unable to start sms services.



2. Log in to the SC and verify that SMS software startup has completed. Type:

sc0:sms-user:> showplatform

3. Wait until showplatform finishes displaying platform status.

At this point you can begin using SMS programs.

SMS Console Window

An SMS console window provides a command-line interface from the SC to the Solaris operating environment on the domain(s).


procedure icon  To Display a Console Window Locally

1. Log in to the SC, if you have not already done so.


Note - You must have domain privileges for the domain on which you wish to run console.



2. Type:

sc0:sms-user:> console -d domain_indicator  option

where:

-d

Specifies the domain using a domain_indicator:

domain_id - ID for a domain. Valid domain_ids are 'A'...'R' and are case insensitive.

domain_tag - Name assigned to a domain using addtag(1M).

-f

Force

Opens a domain console window with "locked write" permission, terminates all other open sessions, and prevents new ones from being opened. This constitutes an "exclusive session." Use it only when you need exclusive use of the console (for example, for private debugging). To restore multiple-session mode, either release the lock (~^) or terminate the console session (~.).

-g

Grab

Opens a console window with "unlocked write" permission. If another session has "unlocked write" permission, the new console window takes it away. If another session has "locked" permission, this request is denied and a read-only session is started.

-l

Lock

Opens a console window with "locked write" permission. If another session has "unlocked write" permission, the new console window takes it away. If another session has "locked" permission, the request is denied and a read-only session is started.

-r

Read Only

Opens a console window in read-only mode


console creates a remote connection to the domain's virtual console driver, making the window in which the command is executed a "console window" for the specified domain (domain_id or domain_tag).

If console is invoked without any options when no other console windows are running for that domain, it comes up in exclusive "locked write" mode session.

If console is invoked without any options when one or more non-exclusive console windows are running for that domain, it will come up in "read-only" mode.

Locked write permission is more secure. It can only be taken away if another console is opened using console -f or if ~*(tilde-asterisk) is entered from another running console window. In both cases, the new console session is an "exclusive session", and all other sessions are forcibly detached from the domain virtual console.

console can utilize either Input Output Static Random Access Memory (IOSRAM) or the internal management network for domain console communication. You can manually toggle the communication path by using the ~= (tilde-equal sign) command. Doing so is useful if the network becomes inoperable, in which case the console sessions appears to be hung.

Many console sessions can be attached simultaneously to a domain, but only one console will have write permissions; all others will have read-only permissions. Write permissions are in either "locked" or "unlocked" mode.

Tilde Usage

In a domain console window, a tilde ( ~ ) that appears as the first character of a line is interpreted as an escape signal that directs console to perform some special action, as follows:

Character

Description

~?

Status message

~.

Disconnects console session

~#

Breaks to OpenBoottrademark PROM or kadb

~@

Acquires unlocked write permission. See option -g

~^

Releases write permission

~=

Toggles the communication path between the network and IOSRAM interfaces. You can use ~= only in private mode (see ~* ).

~&

Acquires locked write permission; see option -l . You may issue this signal during a read-only or unlocked write session.

~*

Acquires locked write permission, terminates all other open sessions, and prevent new sessions from being opened; see option -f . To restore multiple-session mode, either release the lock or terminate this session.


rlogin also processes tilde-escape sequences whenever a tilde is seen at the beginning of a new line. If you need to send a tilde sequence at the beginning of a line and you are connected using rlogin, use two tildes (the first escapes the second for rlogin). Alternatively, do not enter a tilde at the beginning of a line when running inside of an rlogin window.

If you use a kill -9 command to terminate a console session, the window or terminal in which the console command was executed goes into raw mode, and appears hung. Type CTRL-j, then stty sane, then CTRL-j to escape this condition,

In the domain console window, vi(1) runs properly and the escape sequences (tilde commands) work as intended only if the environment variable TERM has the same setting as that of the console window.

For example:

sc0:sms-user:> setenv TERM xterm 

If you need to resize the window, type:

sc0:sms-user:> stty rows 20 cols 80

For more information on domain console, see Domain Console and refer to the console man page.

Remote Console Session

In the event that a system controller hangs and that console cannot be reached directly, SMS provides the smsconnectsc command to remotely connect to the hung SC. This command works from either the main or spare SC. For more information and examples, refer to the smsconnectsc man page.

Your other option is to connect to the hung SC using an external console connection but you cannot run smsconnectsc and use an external console at the same time.


Sun Management Center

Sun Management Center for Sun Fire high-end systems is an extensible monitoring and management tool that provides a system administrator with the ability to manage the Sun Fire high-end system. Sun Management Center integrates standard SNMP based management structures with new intelligent and autonomous agent and management technology based on the client/ server paradigm.

Sun Management Center is used as the GUI and SNMP manager/agent infrastructure for the Sun Fire system. The features and functions of Sun Management Center are not covered in this manual. For more information, refer to the latest Sun Management Center documentation available at www.docs.sun.com.