C H A P T E R  7

Monitoring the System Using the CLI

This chapter describes the CLI commands you can use to monitor the 5800 system. For information on using the GUI to monitor the system, see Monitoring the 5800 System Using the GUI.

This chapter contains the following sections:



Note - For instructions on accessing the CLI commands described in this chapter, see Using the Administrative Interfaces.



Obtaining System Status



Note - In a multicell configuration, you can specify a cell ID with the -c or --cellid option to see information about a particular cell. If you do not specify a cell ID, information about all cells is displayed. If you use -v (--verbose) or -i (--interval) options with the sysstat command in a multicell configuration, you must specify the cell ID.


For example:


ST5800 $ sysstat
Cell 23: Online. Estimated Free Space: 14.96 TB
16 nodes online, 64 disks online.
Data VIP 10.7.226.22, Admin VIP 10.7.226.21
Data services Online, Query Engine Status: HAFaultTolerant
Data Integrity check not completed since boot
Data Reliability check not completed since boot
Query Integrity not established
NDMP status: Backup ready.

The output that the sysstat command produces is explained below. Data reported is for all online disks in the entire system.

HAFaultTolerant - Query services are available and highly fault tolerant.

FaultTolerant - Query services are available, but not as fault tolerant as in the HAFaultTolerant state.

Operational - Query services are available, but not fault tolerant.

Starting - The query engine is starting up. This process may include creating the query database or recreating the connection to the database. Query services are not available during this process.

Unknown - The query engine is in an undetermined state. This may be because it is too early in the starting up process to establish a connection to the query engine, or the query engine is in the process of restarting.

Stopped - The query engine is stopped; query services are not available.

Unavailable - The query engine is not returning any status at this time, probably because it is in a transitional state; query services may not be available.

Nonoperational - The query engine is corrupted; no query services will be available until the system has completed recreating the engine.

Verbose output includes the online/offline status of each node and disk in the system. The online/offline status reported by the command refers to the logical system status. To see the state of hardware components, refer to the hwstat command described in Obtaining FRU Listings.

For example:


ST5800 $ sysstat --verbose
NODE-101     [ONLINE]
DISK-101:0   [ONLINE]
DISK-101:1   [OFFLINE]
DISK-101:2   [ONLINE]
DISK-101:3   [ONLINE]
NODE-102     [ONLINE]
DISK-102:0   [ONLINE]
DISK-102:1   [ONLINE]
DISK-102:2   [ONLINE]
DISK-102:3   [ONLINE]
NODE-103     [ONLINE]
DISK-103:0   [ONLINE]

 



Note - If a disk is listed as off line, the disk should be replaced.



Displaying Performance Statistics

The metrics displayed reflect activity during a specified time internal (default is 15 seconds). There is a delay between the time an action occurred on the system and the time it is displayed by the perfstats command.

For example:


ST5800 $ perfstats
 
Cell Performance Statistics:
 
                                 Avg         Avg
                     # Ops      Op/sec     KB/sec
                   -------- ----------  ------------
  Add MD:                 0     0.00          0.00
  Store:                  0     0.00          0.00
  Retrieve:               1     0.20          0.15
  Retrieve MD:            0     0.00          0.00
  Delete:                 0     0.00             -
  Query:                687    22.90             -
  WebDAV Put:             0     0.00          0.00
  WebDAV Get:             0     0.00          0.00
 
Hive Performance Statistics:
 
Load 1m: 4.12 Load 5m: 4.21 Load 15m: 4.43
Disk Used: 241.28 GB  Disk Total: 13.38 TB  Usage: 1.8% 



Note - To specify that the perfstats command should run indefinitely, use --howlong 0.


For example:


ST5800 $ perfstats --node NODE-101
 
NODE-101 Performance Statistics:
 
 
 
                                 Avg         Avg
                     # Ops      Op/sec     KB/sec
                   -------- ----------  ------------
  Add MD:                 0     0.00          0.00
  Store:                  0     0.00          0.00
  Retrieve:               1     0.20          0.15
  Retrieve MD:            0     0.00          0.00
  Delete:                 0     0.00             -
  Query:                687    22.90             -
  WebDAV Put:             0     0.00          0.00
  WebDAV Get:             0     0.00          0.00
 
Hive Performance Statistics:
 
Load 1m: 4.12 Load 5m: 4.21 Load 15m: 4.43
Disk Used: 241.28 GB  Disk Total: 13.38 TB  Usage: 1.8%


Viewing the System Software Version

For example:

ST5800 $ version
ST5800 1.1 release [1.1-11076]

For example:



Note - In normal operation, all nodes should be running the same version of Service Management Daughter Card (SMDC) and the same version of Basic Input Output System (BIOS).


 

ST5800 $ version --verbose
ST5800 1.1 release [1.1-11076]
Service Node:
        BIOS Version: 1.1.3
        SMDC Version: 4.13
Switch:
        Overlay Version (sw#1): 11068
        Overlay Version (sw#2): 11068
 
NODE-101:
        BIOS version: 0.1.8
        SMDC version: 4.18
NODE-102:
        BIOS version: 0.1.8
        SMDC version: 4.18
NODE-103:
        BIOS version: 0.1.8
        SMDC version: 4.18
NODE-104:
        BIOS version: 0.1.8
        SMDC version: 4.18
 
.
.
.
 
 
ST5800 $


Obtaining FRU Listings


Obtaining Disk Status

Use the df command to display a summary of disk usage. In a multicell configuration, you can specify a cell ID with the -c or --cellid option to see information about a particular cell. If you do not specify a cell ID, information about all cells is displayed.



Note - In a multicell configuration, the df -p or df --physical option, which displays the physical free space on all disks, requires a cell ID.


Note the following information about the utilization numbers displayed:

The displayed numbers refer to the logical space used by, or available for, the user’s data storage. Reserved space is reserved by the system for data recovery operations, and is not available to the user. Total space is a sum of Available + Used + Reserved. The use percentage is calculated as Used / (Used + Available).

For example:


ST5800 $ df -h
Contacting all cells, please wait.
 
All Cells:
Total:    52.18 TB; Avail:    51.33 TB; Used:   864.16 GB; Usage: 1.6%
 
Cell  22:
Total:    26.71 TB; Avail:    26.29 TB; Used:   438.70 GB; Usage: 1.6%
 
Cell  23:
Total:    25.46 TB; Avail:    25.05 TB; Used:   425.46 GB; Usage: 1.6%



Note - The system can no longer accept objects for storage when any disk in the system reaches 80% capacity.


For example:


ST5800 $ df --physical
All sizes expressed in 1K blocks
DISK-101:0: Total: 449128448; Avail: 434057216; Used:  15071232; Usage: 3.4%
DISK-101:1: Total: 449128448; Avail: 444561408; Used:   4567040; Usage: 1.0%
DISK-101:2: Total: 449128448; Avail: 444561408; Used:   4567040; Usage: 1.0%
DISK-101:3: Total: 449128448; Avail: 444561408; Used:   4567040; Usage: 1.0%
DISK-102:0: Total: 449128448; Avail: 444561408; Used:   4567040; Usage: 1.0%.
.
.
.

 


Obtaining Voltage, Temperature, and Fan Speed Data

Use the command sensors to display voltage, temperature, and fan speed data, as collected by system sensors.

For example:


ST5800 $ sensors
 
NODE-101:
        DDR Voltage           2.60 Volts
        CPU Voltage           1.42 Volts
        VCC 3.3V              3.32 Volts
        VCC 5V                5.12 Volts
        VCC 12V               12.03 Volts
        Battery Voltage       2.98 Volts
        CPU Temperature       49 degrees C
        System Temperature    32 degrees C
        System Fan 1 speed    11340 RPM
        System Fan 2 speed    11340 RPM
        System Fan 3 speed    11070 RPM
        System Fan 4 speed    10980 RPM
        System Fan 5 speed    11070 RPM
 
NODE-102:
        DDR Voltage           2.60 Volts
        CPU Voltage           1.43 Volts
        VCC 3.3V              3.32 Volts
        VCC 5V                5.10 Volts
        VCC 12V               12.10 Volts
        Battery Voltage       2.98 Volts
        CPU Temperature       49 degrees C
        System Temperature    33 degrees C
        System Fan 1 speed    11700 RPM
        System Fan 2 speed    11430 RPM
.
.
.
 
ST5800 $