34 Monitoring Servers

The following features and topics are covered in this chapter:

34.1 Get Started With Server Management

You can discover hardware assets in Oracle Enterprise Manager by using the existing discovery method, then deploying the Enterprise Manager agent onto the system. Once they are discovered, you can view monitoring information about the hardware, including incidents, power usage, network information, service processor configuration, and fan and temperature information. The relationship between managed hardware and the operating systems, virtualization platforms, and other software installed on it is also represented in the user interface.

34.2 Location of Server Information in the UI

You can select any target that is a child of a server (virtual platform, guest, or host) and the server will appear in the Navigation pane of the target.

From the All Targets page, you can also click Systems Infrastructure Server to see the list of servers. You can click any server in this list to open the server's home page.

34.3 Actions for Server Management

You can perform the following actions:

  • Discover a server

  • View the server's hardware components

  • View the server's configuration

  • View the server's utilization of resources

34.4 About the Hardware Dashboard

The dashboard is located near the top of the main window. It contains basic information about the server's status, including incidents, power usage, temperature information, core information, and recent events.

The information in the dashboard is automatically displayed. Three dashlets are visible. Click the icon beneath the dashboard to switch to another set of dashlets.

The following dashlets are displayed:

34.4.1 About Basic Hardware Information

The first dashlet contains basic information about the hardware. The heading includes the full hardware name and power status.

The following fields are displayed:

  • IP Address

  • Model

  • Serial Number

  • Health

  • CPU

  • Memory

  • Firmware

  • Locator

34.4.2 About Open Incidents

The second dashlet contains information about open incidents for the hardware.

The following fields are displayed:

  • Fatal

  • Critical

  • Warning

Click on a category to view a detailed view of incidents within this category, including their target, summary, date of last update, whether they have been acknowledged, and status. Click the X icon in the upper right to close this detailed view.

34.4.3 About Fan and Temperature Information

The fourth dashlet displays fan and temperature information.

The following fields are displayed:

  • Fan Usage: This displays the fan usage as a percentage of its maximum.

  • Temperature: This displays the temperature of the hardware in degrees Celsius.

34.4.4 About Power Usage

The third dashlet displays power usage information. A chart displays the power usage as a percentage of the maximum.

The following fields are displayed:

  • Available Power

  • Peak Permitted

  • Used Power

  • Power Policy (SPARC servers only)

34.4.5 About Core Information

The fifth dashlet displays a pie chart showing the number of active and inactive cores.

34.4.6 About the Last Configuration Change and Incident

The sixth dashlet displays the date and time of the last configuration change and the last reported incident.

34.5 Viewing the Hardware Dashboard

  1. Select All Targets from the Targets list.

  2. Under the Servers, Storage, and Network heading, select Systems Infrastructure Server.

    A list of the target servers is displayed.

  3. Click the target name to open the Summary page for the server. The dashlets appear on the top of the page and provide the summary information.

34.6 About Server Metrics

You can view a complete list of the metrics for a selected server.

34.7 Viewing Server Metrics

  1. Select All Targets from the Targets list.

  2. Under the Servers, Storage, and Network heading, select Systems Infrastructure Server.

    A list of the target servers is displayed.

  3. Click the target name to open the Summary page for the server.

  4. Click Systems Infrastructure Server in the upper left corner of the page. Click Monitoring, then click All Metrics.

  5. Click a metric to view details, collection schedule, upload interval and other details.

34.8 About the Photorealistic Image of the Hardware

The Hardware View tab in the main window displays a photorealistic view of compatible hardware, including the front, top, and rear, and a table view of the hardware's components.

Select Photorealistic View to display the photorealistic view of the hardware. Components with incidents are outlined in red.

You can click any component displayed in the photorealistic view to view additional information about that component. The following information is displayed if it is available and relevant to the component:

  • Component Name

  • Manufacturer

  • Serial Number

  • Part Number

  • Total Cores: The number of cores for a CPU

  • Enabled Cores: The number of enabled cores for a CPU

  • Size: The size of memory components in GB

Select Table to display a table of the hardware components. The following information is displayed for each component:

  • Slot Number

  • Component Name

  • Component Type

34.9 Viewing the Photorealistic Image of the Hardware

  1. Select All Targets from the Targets list.

  2. Under the Servers, Storage, and Network heading, select Systems Infrastructure Server.

    A list of the target servers is displayed.

  3. Click the target name to open the Summary page for the server.

  4. Click the Hardware View tab.

34.10 About the Logical View

The Logical View tab in the main window displays detailed information about the hardware components and capabilities. You can select one of the tabs to view detailed information about it.

34.10.1 About CPU Information

The CPU tab shows CPU and CPU usage information.

The top section shows summary information. A pie chart displays the number of installed and available CPUs.

The following fields are displayed:

  • Architecture

  • Clock Speed

  • Model

  • CPU Power Consumption (Watts)

  • Overall Status

The bottom section displays a table showing the available processors, including name, active cores, serial numbers, part number, overall cache in KB, component location, and operational status.

34.10.2 About Memory Information

The Memory tab shows overall memory and DIMM-specific information.

The top section shows summary information. A pie chart displays the number of installed and available DIMMs.

The following fields are displayed:

  • Memory (GB)

  • Memory Power Consumption (Watts)

  • Overall Status

The bottom section displays a table showing the memory modules, including the memory component name, size in GB, manufacturer, part number, serial number, location, and operational status.

34.10.3 About Power Information

The Power tab shows power and power supply information.

The top section shows summary information. A pie chart displays the number of installed and available power supplies. The overall status of the power supply is displayed.

The bottom section displays a table showing the available power supplies, including name, manufacturer, part number, serial number, output power in watts, location, and operational status.

34.10.4 About Fan Information

The Fan tab shows cooling and fan information.

The top section shows summary information. A pie chart displays the number of total and available power supply unit fans. The overall status of the cooling is displayed.

The bottom section displays a table showing the available fans, including name, RPM as a percentage of maximum, location, and operational status.

34.10.5 About Storage Information

The Storage tab shows information about the available storage.

The top section shows summary information. The following fields are displayed:

  • Total Installed Storage (GB)

  • Installed Disk

The bottom section displays a table showing the available storage disks, including the name, size in gigabytes, manufacturer, serial number, part number, and operational status. This information is displayed only if the ILOM is discovered.

34.10.6 About Disk Controller Information

The Disk Controller tab displays a table of the available disk controllers, including name, model, manufacturer, serial number, and operational status.

This information is displayed only if the ILOM is discovered.

34.10.7 About Disk Expander Information

The Disk Expander tab displays a table of the available disk expanders, including name, manufacturer, version, model, firmware version, and chassis ID.

This information is displayed only if the ILOM is discovered.

34.10.8 About Network Ports Information

The Network Ports tab displays information about network interface controllers and network adapters.

The top section shows NIC and status information. The following fields are displayed:

  • Installed Ethernet NICs

  • Overall Status

The bottom section shows a table of network ports, including name, MAC address, description, and operational status.

34.10.9 About PCI Devices Information

The PCI Devices tab displays a table of the PCI devices, including name, description, device class, PCI device ID, PCI vendor ID, PCI end point, PCI sub device ID, and PCI sub device vendor ID.

This information is displayed only if an Agent is deployed on an operating system on the server.

34.10.10 About PDOMs Information

The PDOMs tab displays a table of the physical domains for M-series hardware, including name, configuration status, assigned DCUs, and operational status.

34.10.11 About DCUs Information

The DCUs tab displays a table of the DCUs for M-series hardware, including name, number of CPUs, memory in GB, number of fans, PDOM ID, power status, and operational status.

34.11 Viewing the Logical View

  1. Select All Targets from the Targets list.

  2. Under the Servers, Storage, and Network heading, select Systems Infrastructure Server.

    A list of the target servers is displayed.

  3. Click the target name to open the Summary page for the server.

  4. Click the Logical View tab.

34.12 About Energy Consumption

The Energy tab in the main window displays information about the hardware's energy consumption.

The summary section displays three graphs showing basic temperature, fan speed, and power information. You can use the Time Range dropdown to select a different time interval to display.

The first graph shows the inlet and exhaust temperatures in degrees Celsius.

The second graph shows the fan speed as a percentage of the maximum.

The third graph shows the power consumption and utilization in watts.

You can click the Table View link to view a table of the data points used to create the graph.

34.13 Viewing the Energy Consumption

  1. Select All Targets from the Targets list.

  2. Under the Servers, Storage, and Network heading, select Systems Infrastructure Server.

    A list of the target servers is displayed.

  3. Click the target name to open the Summary page for the server.

  4. Click the Energy tab.

34.14 About Network Connectivity

The Network Connectivity tab in the main window displays information about the hardware's network interfaces, data links, and ports. You can select one of these three options to view detailed information about it.

34.14.1 About Network Interfaces

The Network Interfaces page shows a table of the hardware's network interfaces, including IP address, netmask, and an icon indicating the current state.

You can sort the list by interface name or interface state.

Click the more link for additional information.

34.14.2 About Network Data Links

The Network Data Links page shows a table of the hardware's data links, including name, physical address, media, and VLAN ID.

You can sort the list by data link name or data link state.

Click the more link for additional information, including device and device path.

34.14.3 About Network Ports

The Network Ports page shows a table of the hardware's ports and their types.

You can sort the list by state, connector, or number and name.

Click the more link for additional information, including errors and throughput.

34.15 Viewing the Network Connectivity

  1. Select All Targets from the Targets list.

  2. Under the Servers, Storage, and Network heading, select Systems Infrastructure Server.

    A list of the target servers is displayed.

  3. Click the target name to open the Summary page for the server.

  4. Click the Network Connectivity tab.

34.16 About the Service Processor Configuration

The Service Processor Configuration tab in the main window displays information about the firmware, host policy configuration, power on self test configuration, SP alert configuration, and DNS and NTP settings. You can select one of these tabs to view detailed information about it.

34.16.1 About Firmware Information

The Firmware Information tab shows a table with the component identifier for all installed firmware, the type, the version, and the release date.

34.16.2 About the Host Policy Configuration

The Host Policy Configuration tab shows a table with a list of the host policy names and their current values.

34.16.3 About the Power On Self Test Configuration

The Power On Self Test Configuration tab shows a table with a list of the power on self test setting names and their current values.

34.16.4 About the SP Alert Configuration

The SP Alert Configuration tab shows a table with the service processor alert names. For each alert, the table provides the alert type, alert level, destination address, destination port, SNMP version, and community.

34.16.5 About the DNS & NTP Information

The DNS & NTP tab shows information about the DNS and NTP settings. The following fields are displayed:

  • Auto DNS/DHCP: Indicates whether DNS or DHCP is being used.

  • DNS Servers: Lists the DNS servers in use.

  • Search Path: Lists the search path for the DNS servers.

  • Time: Lists the current time and time zone for the hardware.

  • Use NTP Server: Indicates whether an NTP server is being used.

  • NTP Server 1: The IP address of the first NTP server.

  • NTP Server 2: The IP address of the second NTP server.

34.17 Viewing the Service Processor Configuration

  1. Select All Targets from the Targets list.

  2. Under the Servers, Storage, and Network heading, select Systems Infrastructure Server.

    A list of the target servers is displayed.

  3. Click the target name to open the Summary page for the server.

  4. Click the Service Processor Configuration tab.

34.18 Managing Metrics and Incident Notifications

You can perform the following tasks to manage monitoring and incident notification:

34.18.1 Viewing Metric Collection Errors

Metric collection errors are usually caused by installation or configuration issues. You can view errors for a server.

  1. Click Systems Infrastructure Server from the All Targets page.

  2. Click the target name to open the home page.

  3. Click Systems Infrastructure Server in the upper left corner of the page. Click Monitoring, then click Metric Collection Errors.

34.18.2 Editing Metric and Collection Settings

The Metrics tab contains displays all of the monitored attributes. The default view is metrics with thresholds. For these types of monitored attributes, you can modify the comparison operator, the threshold limits, the corrective action, and the collection schedule.

  1. Click Systems Infrastructure Server from the All Targets page.

  2. Click the target name to open the home page.

  3. Click Systems Infrastructure Server in the upper left corner of the page. Click Monitoring, then click Metric and Collection Settings.

  4. Modify threshold limits or collection schedule. When a threshold field is empty, the alert is disabled for that metric.

  5. Click the Edit icon for advanced settings.

    Click the Other Collected Items tab to view non-threshold monitored attributes. You can modify the collection period for these attributes, or disable monitoring.

  6. Click OK to save your changes.

34.18.3 Editing a Monitoring Configuration

  1. Click Systems Infrastructure Server from the All Targets page.

  2. Click the target name to open the home page.

  3. Click Systems Infrastructure Server in the upper left corner of the page. Click Target Setup.

  4. Click Monitoring Configuration.

34.18.4 Suspending Monitoring Notifications

Brownouts enable you to temporarily suppress notifications on a target. The Agent continues to monitor the target under brownout. You can view the actual target status along with an indication that the target is currently under brownout.

You can create a brownout for a server.

  1. Click Systems Infrastructure Server from the All Targets page.

  2. Click the target name to open the home page.

  3. Click Systems Infrastructure Server in the upper left corner of the page. Click Control.

  4. Click Create Brownout.

  5. Enter a name for the brownout event.

  6. Select a reason from the menu and add comments, as needed.

  7. Click the options to define how jobs will run and the maintenance window.

  8. Click Submit.

34.18.5 Suspending Monitoring for Maintenance

Blackouts enable you to suspend monitoring on one or more targets in order to perform maintenance operations. To place a target under blackout, you must have at least the Blackout Target privilege on the target. If you select a host, then by default all the targets on that host are included in the blackout. Similarly, if you select a target that has members, then by default all the members are included in the blackout.

  1. Click Systems Infrastructure Server from the All Targets page.

  2. Click the target name to open the home page.

  3. Click Systems Infrastructure Server in the upper left corner of the page. Click Control.

  4. Click Create Blackout.

  5. Select a reason from the menu.

  6. Add comments, as needed.

  7. Click Submit.

34.18.6 Ending a Monitoring Brownout or Blackout

You can end a blackout or brownout for a server.

  1. Click Systems Infrastructure Server from the All Targets page.

  2. Click the target name to open the home page.

  3. Click Systems Infrastructure Server in the upper left corner of the page. Click Control.

  4. Click End Blackout or End Brownout.

34.19 Administering Servers

You can perform the following tasks to manage and administer servers:

34.19.1 Viewing Compliance

The Compliance pages enable you to view the compliance framework, standards, and the server's compliance.

  1. Click Systems Infrastructure Server from the All Targets page.

  2. Click the target name to open the home page.

  3. Click Systems Infrastructure Server in the upper left corner of the page. Click Compliance.

  4. Click the option to view Results, Standard Associations, or Real-time Observations.

34.19.2 Identifying Changes in a Server Configuration

When an administrator changes a system's configuration, it can be helpful to know the when the configuration was last changed. This information appears in the configuration dashlet on the Summary page.

To view more detailed information for a server:

  1. Click Systems Infrastructure Server from the All Targets page.

  2. Click the target name to open the home page.

  3. Click Systems Infrastructure Server in the upper left corner of the page. Click Configuration.

  4. Click the option to view Last Collected, Comparison and Drift Management, Compare, Search, History, Save, Saved, or Topology.

34.19.3 Editing Server Administrator Access

  1. Click Systems Infrastructure Server from the All Targets page.

  2. Click the target name to open the home page.

  3. Click Systems Infrastructure Server in the upper left corner of the page. Click Target Setup.

  4. Click Administrator Access.

34.19.4 Adding a Server to a Group

  1. Click Systems Infrastructure Server from the All Targets page.

  2. Click the target name to open the home page.

  3. Click Systems Infrastructure Server in the upper left corner of the page. Click Target Setup.

  4. Click Add to Group.

34.19.5 Editing Server Properties

  1. Click Systems Infrastructure Server from the All Targets page.

  2. Click the target name to open the home page.

  3. Click Systems Infrastructure Server in the upper left corner of the page. Click Target Setup.

  4. Click Properties.

34.20 Related Resources for Server Management

See the following chapters for more information: