Oracle Light Weight Availability Collection Tool User Guide

Important Fields in the Availability Datagram

This section identifies and describes the important fields of the Availability datagram (listed here in alphabetical order).

Field Name

Definition

Adjusted availability

Represented in percentage as: ((Total uptime + Total Planned downtime) /Total elapsed time) * 100


Note –

Planned downtime is considered as uptime in this instance; hence, the term adjusted availability.


Downtime

The duration during which the host was out of run level 3 is considered as downtime (that is, the difference in coordinated universal time (UTC) between the outage event and its corresponding boot event). Downtime is recorded as a part of the outage event (panic/halt). It is decided by the wasPlanned field. wasPlanned can be one of the following designations:

  • Undefined (value of 0)

  • Planned (value of 1)

  • Unplanned (value of 2)

In the sample datagram (above), event #2 is a panic event, and event #3 is its corresponding boot event; the difference in UTC of event #3 and event #2 is the downtime. Therefore, downtime = 1207861340 - 1207861339 (= 1 sec)

Since the wasPlanned flag is 2, the downtime is marked against the field dwnUnplnd (Unplanned downtime)

Total availability

Represented in percentage as: (Total uptime/Total elapsed time) * 100

Types of Events

The following types of events are recorded in the Availability datagram by the Oracle Lightweight Availability Collection Tool:

  • epoch

    Marks the beginning of event tracking. It is recorded only once in the Availability datagram (at the inception). The UTC of this event marks the inception time of the Oracle Lightweight Availability Collection Tool on the monitored host.

  • boot

    Whenever the host returns to run level 3, a boot event is recorded in the datagram along with the corresponding timestamp.

  • halt

    Whenever the host leaves run level 3 to any other level, a halt event is created with the time of halt being the time the host left run level 3.

  • panic

    If the host encounters an un-natural downing such as system crash, upon the subsequent boot of the host (that is, a return to run level 3), a panic event is recorded where the time of the panic event is the time at which the Oracle Lightweight Availability Collection Tool stopped running.

  • time

    Indicates the last recorded UTC for offline reporting. This event contains the consolidated uptime and downtime information. It also reports the elapsed time (measured as the duration in UTC that the Oracle Lightweight Availability Collection Tool is monitoring this host since inception). Apart from this information, the time event also reports system availability in two forms: Total availability and Adjusted availability.

Uptime

The difference in UTC between the current outage event and the last event before it, which would be a boot event, is measured as uptime.

In the sample datagram (above), if the uptime field in event #1 (boot event) is calculated as the difference in UTC between event #3 and event #2uptime = 1207861339 - 1207784519 (= 76820 secs)