Sun Logo


System Management Services (SMS) 1.5 Administrator Guide

for Sun Firetrademark High-End Systems

817-7295-10



Contents

Preface

1. Introduction to System Management Services

Sun Fire High-End Systems

Redundant SCs

SMS Features

Features Provided in Previous Releases of SMS

New Features Provided in SMS 1.5 Release

VCMON

System Architecture

SMS Administration Environment

Network Connections for Administrators

SMS Operating System

procedure iconsmall spaceTo Begin Using the SC

SMS Console Window

procedure iconsmall spaceTo Display a Console Window Locally

Tilde Usage

Remote Console Session

Sun Management Center

2. SMS 1.5 Security

Domain Security Overview

System Controller Security Overview

Redundant System Controllers

SC Network Interfaces

Main SC Network Interfaces

Domain-to-SC Communication (scman0) Interface

SC-to-SC Communication (scman1) Interface

Spare SC Network Interfaces

Main and Spare Network Interface Sample Configurations

What Has Changed in SMS 1.5

Secure By Default (Fresh Install)

Secure By Choice (Upgrade)

Changes

Assumptions and Limitations

Obtaining Support

Initial/Fresh SMS Install Using smsinstall Command (Secure by Default)

Customizing the Solaris Security Toolkit

Optionally Securing Domains

SMS Upgrade Install Using smsupgrade Command (Secure by Choice)

Optionally Securing Domains

Using Solaris Security Toolkit to Secure the System Controller

Solaris Security Toolkit Software

Customizing the Solaris Security Toolkit Driver

procedure iconsmall spaceTo Disable I1 Traffic (Domain Exclusion)

procedure iconsmall spaceTo Enable ftp or telnet

procedure iconsmall spaceTo View the Contents of the Driver File

procedure iconsmall spaceTo Undo a Solaris Security Toolkit Run

3. SMS Administrative Privileges

Platform Administrator Group

Platform Operator Group

Platform Service Group

Domain Administrator Group

Domain Configuration Group

Superuser Privileges

All Privileges

4. SMS Internals

Startup Flow

SMS Daemons

Capacity on Demand Daemon

Domain Configuration Agent

Domain Status Monitoring Daemon

Domain X Server

Error and Fault Handling Daemon

Event Log Access Daemon

Event Reporting Daemon

Environmental Status Monitoring Daemon

Failover Management Daemon

FRU Access Daemon

Hardware Access Daemon

Key Management Daemon

Management Network Daemon

Message Logging Daemon

OpenBoot PROM Support Daemon

Platform Configuration Database Daemon

Platform Configuration

Domain Configuration

System Board Configuration

SMS Startup Daemon

Scripts

Spare Mode

Main Mode

Domain-Specific Process Startup

Monitoring and Restarts

SMS Shut Down

Task Management Daemon

Environment Variables

5. SMS Domain Configuration

Domain Configuration Units

Domain Configuration Requirements

DCU Assignment

Static Versus Dynamic Domain Configuration

Global Automatic Dynamic Reconfiguration

Configuration for Platform Administrators

Available Component List

procedure iconsmall spaceTo Set Up the Available Component List

Configuring Domains

procedure iconsmall spaceTo Name or Change Domain Names From the Command Line

procedure iconsmall spaceTo Add Boards to a Domain From the Command Line

procedure iconsmall spaceTo Delete Boards From a Domain From the Command Line

procedure iconsmall spaceTo Move Boards Between Domains From the Command Line

procedure iconsmall spaceTo Set Domain Defaults

procedure iconsmall spaceTo Obtain Board Status

procedure iconsmall spaceTo Obtain Domain Status

Virtual Time of Day

Setting the Date and Time

procedure iconsmall spaceTo Set the Date on the SC

procedure iconsmall spaceTo Set the Date for Domain eng2

procedure iconsmall spaceTo Display the Date on the SC

procedure iconsmall spaceTo Display the Date on Domain eng2

Configuring NTP

procedure iconsmall spaceTo Create the ntp.conf File

Virtual ID PROM

The flashupdate Command

Configuration for Domain Administrators

Configuring Domains

procedure iconsmall spaceTo Add Boards to a Domain From the Command Line

procedure iconsmall spaceTo Delete Boards From a Domain From the Command Line

procedure iconsmall spaceTo Move Boards Between Domains From the Command Line

procedure iconsmall spaceTo Set Domain Defaults

procedure iconsmall spaceTo Obtain Board Status

procedure iconsmall spaceTo Obtain Domain Status

procedure iconsmall spaceTo Obtain Device Status

Virtual Keyswitch

The setkeyswitch Command

procedure iconsmall spaceTo Set the Virtual Keyswitch On in Domain A

procedure iconsmall spaceTo Display the Virtual Keyswitch Setting in Domain A

Virtual NVRAM

Setting the OpenBoot PROM Variables

procedure iconsmall spaceTo Recover From a Repeated Domain Panic

procedure iconsmall spaceTo Set the OpenBoot PROM Security Mode Variable in Domain A

procedure iconsmall spaceTo See the OpenBoot PROM Variables

Degraded Configuration Preferences

The setbus Command

procedure iconsmall spaceTo Set All Buses on All Active Domains to Use Both CSBs

The showbus Command

procedure iconsmall spaceTo Show All Buses on All Active Domains

6. Automatic Diagnosis and Recovery

Automatic Diagnosis and Recovery Overview

Hardware Errors Associated with Domain Stops

Non-Fatal Domain Hardware Errors

POST-Detected Hardware Failures

Enabling Email Event Notification

procedure iconsmall spaceTo Enable Email Event Notification

Configuring an Email Template

Configuring the Email Control File

Testing Email Event Notification

procedure iconsmall spaceTo Test Email Event Notification

What To Do If Test Email Fails

Obtaining Diagnosis and Recovery Information

Reviewing Diagnosis Events

Reviewing the Event Log

7. Capacity on Demand

COD Overview

COD Licensing Process

COD RTU License Allocation

Instant Access CPUs

Instant Access CPUs as Hot Spares

Resource Monitoring

Getting Started with COD

Managing COD RTU Licenses

procedure iconsmall spaceTo Obtain and Add a COD RTU License Key to the COD License Database

procedure iconsmall spaceTo Delete a COD License Key From the COD License Database

procedure iconsmall spaceTo Review COD License Information

Activating COD Resources

procedure iconsmall spaceTo Enable Instant Access CPUs and Reserve Domain RTU Licenses

Monitoring COD Resources

COD System Boards

procedure iconsmall spaceTo Identify COD system Boards

COD Resource Usage

procedure iconsmall spaceTo View COD Usage By Resource

procedure iconsmall spaceTo View COD Usage by Domain

procedure iconsmall spaceTo View COD Usage by Resource and Domain

Deconfigured and Unlicensed COD CPUs

Other COD Information

8. Domain Control

Domain Boot

Keyswitch On

Power

procedure iconsmall spaceTo Power System Boards On and Off From the Command Line

procedure iconsmall spaceTo Recover From Power Failure

Domain-Requested Reboot

Automatic System Recovery (ASR)

Fast Boot

Domain Abort/Reset

Hardware Control

Power-On Self-Test (POST)

Blacklist Editing

Platform and Domain Blacklisting

procedure iconsmall spaceTo Blacklist a Component

procedure iconsmall spaceTo Remove a Component From the Blacklist

ASR Blacklist

Power Control

Fan Control

Hot-Plug

Hot-Unplug

Hot-Plug

SC Reset and Reboot

procedure iconsmall spaceTo Reset the Main or Spare SC

HPU LEDs

9. Domain Services

Management Network Overview

I1 Network

I2 Network

External Network Monitoring

MAN Daemons and Drivers

Management Network Services

Domain Console

Message Logging

Dynamic Reconfiguration

Network Boot and Solaris Software Installation

SC Heartbeats

10. Domain Status

Software Status

Status Commands

showboards Command

showdevices Command

showenvironment Command

showobpparams Command

showplatform Command

showxirstate Command

Solaris Software Heartbeat

Hardware Status

Hardware Configuration

Environmental Status

procedure iconsmall spaceTo Display the Environment Status for Domain A

Hardware Error Status

SC Hardware and Software Status

11. Domain Events

Message Logging

Log File Maintenance

Log File Management

Domain Reboot Events

Domain Reboot Initiation

Domain Boot Failure

Domain Panic Events

Domain Panic

Domain Panic Hang

Repeated Domain Panic

Solaris Software Hang Events

Hardware Configuration Events

Hot-Plug Events

Hot-Unplug Events

POST-Initiated Configuration Events

Environmental Events

Over-Temperature Events

Power Failure Events

Out-of-Range Voltage Events

Under-Power Events

Fan Failure Events

Clock Failure Events

Hardware Error Events

Domain Stop Events

CPU-Detected Events

Record Stop Events

Other ASIC Failure Events

SC Failure Events

12. SC Failover

Overview

Fault Monitoring

File Propagation

Failover Management

Startup

Main SC

Spare SC

Failover CLI Commands

setfailover Command

showfailover Command

Command Synchronization

cmdsync CLIs

initcmdsync Command

savecmdsync Command

cancelcmdsync Command

runcmdsync Command

showcmdsync Command

Data Synchronization

setdatasync Command

showdatasync Command

Failure and Recovery

Failover on Main SC (Main-Controlled Failover)

Fault on Main SC (Spare Takes Over Main Role)

I2 Network Fault

Fault on Main SC (I2 Network Is Also Down)

Fault Recovery and Reboot

I2 Fault Recovery

Reboot and Recovery

Client Failover Recovery

Security

13. SMS Utilities

SMS Backup Utility

SMS Restore Utility

SMS Version Utility

Version Switching

procedure iconsmall spaceTo Switch Between Two Adjacent, Co-resident Installations of SMS

SMS Configuration Utility

UNIX Groups

Access Control List (ACL)

Network Configuration

MAN Configuration

A. SMS man Pages

B. Error Messages

Installing SMSHelp

procedure iconsmall spaceTo Install theSUNWSMSjh Package

procedure iconsmall spaceTo Start smshelp

Types of Errors

Error Categories

Glossary

Index