C H A P T E R 6 |
Disk Control and Monitor Utility (DCMU) for RHEL |
This chapter describes how to use the Disk Control and Monitor Utility (DCMU) on a Red Hat Enterprise Linux (RHEL4.7 or 5.2) 64-bit operating system. It includes the following sections:
The Disk Control and Monitor Utility (DCMU) controls and monitors all 48-disk drives on the Sun Fire X4540 server and provides the following features:
You may access these logs individually for specific information to aid in the administration or troubleshooting of the disk array. DCMU for RHEL consist of three components. Each component updates the FRU, SDR (Sensor Data Record), SEL (System Event Log) and service processor logs:
IPMItool is an additional command line component that may be used as a supplement to DCMU to access service processor sensor data, read the SEL and FRU information, and perform chassis power control operations via the server’s Service Processor.
To use Disk Control and Monitor Utility (DCMU), you must install the application. To install the application, you should perform the following steps:
The installation of DCMU consists of one step because the package is in rpm format.
To Install DCMU |
# rpm -i dcmu-rhel4-1.2-0.x86_64.rpm
# rpm -i dcmu-rhel5-1.2-0.x86_64.rpm
The following files are installed as components of the DCMU installation:
The initial installation of the DCMU components prepares the system for running the DCMU utilities described in this chapter. However, since the DCMU utilities also require that the IPMI service is running, you have two options before you can start using the DCMU utilities: manually start the IPMI service, or reboot the server, which will automatically start faultmond and IPMI.
If rebooting the server after the initial DCMU installation is not possible, and you wish to run DCMU utilities, you must first start the IPMI service by entering the following command:
Note - After the initial installation of DCMU, rebooting the server will start both IPMI and faultmond. |
If you need to uninstall DCMU, perform the following procedure.
To Uninstall DCMU |
The cfgdisk command is a command-line utility and that queries and provides status of all 48-disk drives located in the Sun Fire X4540 server. The cfgdisk command also allows you to connect and disconnect disk drives from the OS while also allowing you to monitor disks connected to the server.
Use the cfgdisk command to connect and disconnect disk drives and determine disk drive status by using the parameters shown in TABLE 6-1. The following options are supported:
This section contains examples of common cfgdisk commands. For more information and options, refer to the cfgdisk man page.
The following command displays a map of all disk drives:
Here is an example of cfgdisk command output listing physical slot number, logical name, and status information.
Use the cfgdisk command to disconnect a disk before performing the hot plug event of physically removing it. The following command shows an example of how to use cfgdisk to disconnect a disk drive.
# cfgdisk -o disconnect -d sata5/1
The command returns the following prompts. Enter Y at both to disconnect the disk:
Are you sure (y/n)? y Are you sure sata5/1 device is not in use(y/n)? y Device sata5/1 has been successfully disconnected. |
After performing the hot plug event of physically adding a disk into the system, use the cfgdisk command to connect it. The following command shows an example of how to use cfgdisk to connect a disk drive.
# cfgdisk -o connect -d sata5/1
The command returns the following:
Command has been issued to connect sata5/1 device, it may take a few seconds to connect sata5/1, check status by re-running cfgdisk command. |
The following command show how to use the cfgdisk command to display help information:
A component of DCMU, faultmond is a daemon which is started at boot time. It scans all disk at polling intervals, and then reports FRU, SDR, and SEL information to the service processor.
The command line parameters shown in TABLE 6-2 are available for faultmond:
This section contains examples of common faultmond commands. For more information, refer to the faultmond man page.
The following command shows the use of faultmond.
The command, for example, returns the following:
To start faultmond, enter the following command:
To stop faultmond, enter the following command:
To set the polling interval with faultmond, do the following:
1. Stop faultmond from the command line.
2. Set the polling interval. For example, to set the polling interval to be 1 minute, you would enter:
3. Check the polling interval.
The output would look like the following:
# ps -ef |grep faultmond root 15357 1 5 15:49 ? 00:00:00 faultmond -t 1 root 15364 15307 0 15:50 pts/4 00:00:00 grep faultmond |
A component of DCMU, hotplugmon is not a command line utility. It monitors hotplug events and then reports them to service processor.
This section describes how to view individual log file information from the command line.
The IPMItool is a command line utility that reads the sensor data repository (SDR) and displays sensor values, System Event Log (SEL), Field Replaceable Unit (FRU) inventory information, gets and sets LAN configuration parameters, and performs chassis power control operations via the server’s Service Processor.
For your convenience, this tool is available as an installable rpm (x86 64-bit or i386 32-bit versions) on your server’s Tools and Drivers CD, or as a downloadable package from the Sun software download site:
http://www.sun.com/servers/x64/x4540/downloads.jsp
For information about using the IPMItool, refer to your Sun Integrated Lights Out Manager 2.0 User’s Guide (820-1188)
The following commands show how view the SDR log file, either at the server:
# ipmitool -I lan -H SP-IP -U root -P SP-password sdr elist
Where SP-IP represents the IP address of the service processor and SP-password represents the password for the service processor.
The following commands show how view the FRU log file, either at the server:
# ipmitool -I lan -H SP-IP -U root -P SP-password fru
Where SP-IP represents the IP address of the service processor and SP-password represents the password for the service processor.
The following commands show how view the SEL log file, either at the server:
# ipmitool -I lan -H SP-IP -U root -P SP-password sel elist
Where SP-IP represents the IP address of the service processor and SP-password represents the password for the service processor.
All events and error information from DCMU are logged in syslog (default: /var/log/messages). These include hard drive hotplug events, drive disconnect and connect events, and drive fault polling events.
Copyright © 2009 Sun Microsystems, Inc. All rights reserved.