6.7 Checking Exadata Storage Servers

After powering up the rack, check the Oracle Exadata Storage Servers.

Perform the following checks on each Exadata Storage Server:

  1. Connect to the storage server console.

    Use one of the following connection options:

    • Attach a serial cable between a console and the SER MGT port on the server.

      Connections through the modular jack serial port must use shielded cables to comply with radio frequency emission limits.

      Set the terminal device communication properties to these values: 9600 baud, 8 bit, no parity, 1 stop bit.

      After you connect to ILOM, run the start /SP/console command to connect to the host console.

      For more details, see Connect to Oracle ILOM.

    • Attach an Ethernet cable to the NET MGT port on the server and then connect using a workstation on the network.

      After you connect to ILOM, run the start /SP/console command to connect to the host console.

      For more details, see Connect to Oracle ILOM.

    • Connect using the KVM switch, if one is available.
  2. Log in to the storage server as the root user.

    Note:

    If you do not have the password for the root user, then contact Oracle Support Services.
  3. Verify the SysSN label for the storage server matches the product_serial_number value using the following command:

    # ipmitool sunoem cli "show /SYS product_serial_number"
    Connected.Use ^ D to exit.
    ->show /SYS product_serial_number
    /SYS
    Properties:
    product_serial_number =0937XFG03B
    ->Session closed
    Disconnected
    
  4. Check the hardware and firmware using the following command:

    # /opt/oracle.SupportTools/CheckHWnFWProfile
    

    If output does not show success, then examine the output for mismatched hardware or firmware and take steps to correct those as explained in the following steps.

  5. Verify the disks on the server are visible and online using the following command. The disks are numbered slot 0 to slot 11.

    • If you are using Oracle Exadata System Software release 19c or later, then get the enclosure ID:

      # cd /opt/MegaRAID/storcli
      # ./storcli64 /c0/eall show
      Controller = 0
      Status = Success
      Description = None
      
      Properties :
      ==========
      
      -----------------------------------------------------------------------------------------
      EID State Slots PD PS Fans TSs Alms SIM Port#                      ProdID VendorSpecific
      -----------------------------------------------------------------------------------------
        8 OK       14 12  0    0   0    0   0 Port 0 - 3 & Port 4 - 7 x8 X5-2L  SPMS   v`
      252 OK        8  0  0    0   0    0   1 Internal                   SGPIO        
      -----------------------------------------------------------------------------------------
      
      EID-Enclosure Device ID |PD-Physical drive count |PS-Power Supply count|
      TSs-Temperature sensor count |Alms-Alarm count |SIM-SIM Count

      Then show the state of the physical devices in that enclosure:

      # cd /opt/MegaRAID/storcli
      # ./storcli64 /c0/e8/sall show all | grep "8:"
      8:0      16 Onln   0 7.151 TB SAS  HDD N   N  512B H7280A520SUN8.0T U
      8:1      17 Onln   1 7.151 TB SAS  HDD N   N  512B H7280A520SUN8.0T U
      8:2      20 Onln   2 7.151 TB SAS  HDD N   N  512B H7280A520SUN8.0T U
      8:3      15 Onln   3 7.151 TB SAS  HDD N   N  512B H7280A520SUN8.0T U
      8:4      12 Onln   4 7.151 TB SAS  HDD N   N  512B H7280A520SUN8.0T U
      8:5      18 Onln   5 7.151 TB SAS  HDD N   N  512B H7280A520SUN8.0T U
      8:6      11 Onln   6 7.151 TB SAS  HDD N   N  512B H7280A520SUN8.0T U
      8:7       9 Onln   7 7.151 TB SAS  HDD N   N  512B H7280A520SUN8.0T U
      8:8      14 Onln   8 7.151 TB SAS  HDD N   N  512B H7280A520SUN8.0T U
      8:9      21 Onln  11 7.151 TB SAS  HDD N   N  512B H7280B520SUN8.0T U
      8:10     13 Onln   9 7.151 TB SAS  HDD N   N  512B H7280A520SUN8.0T U
      8:11     10 Onln  10 7.151 TB SAS  HDD N   N  512B H7280A520SUN8.0T U

      The rows displayed are from a table that displays the following information:

      
      EID:Slt DID State DG     Size Intf Med SED PI SeSz Model            Sp
      -----------------------------------------------------------------------
      ...
      EID-Enclosure Device ID|Slt-Slot No.|DID-Device ID|DG-DriveGroup
      DHS-Dedicated Hot Spare|UGood-Unconfigured Good|GHS-Global Hotspare
      UBad-Unconfigured Bad|Onln-Online|Offln-Offline|Intf-Interface
      Med-Media Type|SED-Self Encryptive Drive|PI-Protection Info
      SeSz-Sector Size|Sp-Spun|U-Up|D-Down|T-Transition|F-Foreign
      UGUnsp-Unsupported|UGShld-UnConfigured shielded|HSPShld-Hotspare shielded
      CFShld-Configured shielded|Cpybck-CopyBack|CBShld-Copyback Shielded
    • If you are using Oracle Exadata System Software release 18c or earlier, then use the following command:

      # cd /opt/MegaRAID/MegaCli
      # ./MegaCli64 -Pdlist -aAll |grep "Slot \|Firmware"
      Slot Number:0
      Firmware state:Online, Spun Up
      Slot Number:1
      Firmware state:Online, Spun Up
      Slot Number:2
      Firmware state:Online, Spun Up
      Slot Number:3
      Firmware state:Online, Spun Up
      Slot Number:4
      Firmware state:Online, Spun Up
      Slot Number:5
      Firmware state:Online, Spun Up
      Slot Number:6
      Firmware state:Online, Spun Up
      Slot Number:7
      Firmware state:Online, Spun Up
      Slot Number:8
      Firmware state:Online, Spun Up
      Slot Number:9
      Firmware state:Online, Spun Up
      Slot Number:10
      Firmware state:Online, Spun Up
      Slot Number:11
      Firmware state:Online, Spun Up
  6. If applicable, verify the PMEM modules are visible.

    # cellcli -e list physicaldisk where disktype=PMEM
  7. Verify the flash modules are visible. The number of cards and modules varies depending on the model.

    # cellcli -e list physicaldisk where disktype=flashdisk
    

    If any flash modules are missing, then power cycle the system, and wait 10 minutes after restarting before checking again. If this does not resolve the problem, then re-seat the module and flash card. If the problem continues, then replace the card with the on-site spare.

  8. Check the rear of the rack to ensure the flash card LEDs are green all the way across. If any LED is amber, then check again after the next step. If the LED is still amber after restart, then re-seat the card and ESM or flash module. If the problem continues, then replace the card with the on-site spare.

    Note:

    This step can be done when all systems are on. It does not have to be done on a per-server basis.
  9. Log out of the system using the following command:

    # logout