Verifying the RAID Status of a Compute Server

Oracle recommends that you periodically verify the status of the compute server RAID devices. The impact is minimal. In contrast, the impact of corrective action varies depending on the specific issue uncovered, and can range from simple reconfiguration to an outage.

Log in to each compute server as root and perform the following procedure.

To verify the RAID status:

  1. Check the current disk controller configuration:
    # /opt/MegaRAID/MegaCli/MegaCli64 -AdpAllInfo -aALL | grep "Device Present" -A 8
    
                    Device Present
                    ================
    Virtual Drives    : 1 
      Degraded        : 0 
      Offline         : 0 
    Physical Devices  : 5 
      Disks           : 4 
      Critical Disks  : 0 
      Failed Disks    : 0 
    

    Verify that the output shows one virtual drive, none degraded or offline, five physical devices (one controller + four disks), four disks, and no critical or failed disks.

    If the output is different, then investigate and correct the problem. Degraded virtual drives usually indicate absent or failed physical disks. Replace critical disks and failed disks immediately. Otherwise, you risk data loss if the number of working disks in the server is less than the number required to sustain normal operation.

  2. Check the current virtual drive configuration:
    # /opt/MegaRAID/MegaCli/MegaCli64 CfgDsply -aALL | grep "Virtual Drive:";    \
    /opt/MegaRAID/MegaCli/MegaCli64 CfgDsply -aALL | grep "Number Of Drives";  \
    /opt/MegaRAID/MegaCli/MegaCli64 CfgDsply -aALL | grep "^State" 
    
    Virtual Drive                 : 0 (Target Id: 0)
    Number Of Drives              : 4
    State                         : Optimal
    

    Verify that virtual device 0 has four drives, and the state is Optimal. If the output is different, then investigate and correct the problem.

  3. Check the current physical drive configuration:
    # /opt/MegaRAID/MegaCli/MegaCli64 -PDList -aALL | grep "Firmware state"
    Firmware state: Online, Spun Up
    Firmware state: Online, Spun Up
    Firmware state: Online, Spun Up
    Firmware state: Online, Spun Up
    

    Ensure that all drives are Online, Spun Up. If the output is different, then investigate and correct the problem.

    If the output is different, then investigate and correct the problem. Degraded virtual drives usually indicate absent or failed physical disks. Replace critical disks and failed disks immediately. Otherwise, you risk data loss if the number of working disks in the server is less than the number required to sustain normal operation.