HA Media Interface Keepalive

In an HA node, it is possible for the two systems in the node to lose communication via the management (rear, wancom) interfaces. For example, wancom 1 and wancom 2 might become disconnected, and cause the heartbeat synchronization to fail. This type of failure causes communication errors because both systems try to assume the active role and thereby access resources reserved for the active system.

To avoid these types of conditions, you can enable an option instructing the standby system to take additional time before going to the active state. This check occurs through the system’s media interfaces. Using it, the standby can determine whether or not there has been a true active failure.

In cases when the standby determines the active system has not truly failed, it will go out of service because it will have determined it no longer has up-to-date data from its active counterpart. You can restore functionality by re-establishing management (rear) interface communication between the system in the node, and then re-synchronizes the standby by rebooting it.

When you enable the media interface keepalive, the standby system in the HA node sends ARP requests to determine if the media interfaces’ virtual IP address are active. There are two possible outcomes:

  • If it receives responses to its ARP requests, the standby takes itself out of service—to prevent a conflict with the active.
  • If it does not receive responses to its ARP requests within a timeout value you set, then standby assumes the active role in the HA node.

WARNING:

When using this feature, media-if-peercheck-time, rebooting the active does not result in successful switchover. If you want to force a switchover while using the media interface keepalive timer, use the notify berpd force command.

Impact to Boot-Up Behavior

With the HA media interface keepalive enabled, the Oracle® Enterprise Session Border Controller might be in the initial state longer than if the feature were disabled because it requires more information about the media (front) interfaces.

HA Media Interface Keepalive Configuration

You turn the HA media interface keepalive on by setting a timeout value for the standby to receive responses to its ARP requests before it assumes the active role in the HA node. Keeping this parameter set to 0, its default, disables the keepalive

To enable the HA media interface keepalive:

  1. In Superuser mode, type configure terminal and press Enter.
    ORACLE# configure terminal
    ORACLE(configure)#
  2. Type system and press Enter.
    ORACLE(configure)# system
    ORACLE(system)#
  3. Type redundancy and press Enter.
    ORACLE(session-router)# redundancy
    ORACLE(redundancy)#

    If you are adding this feature to an existing configuration, then you will need to select the configuration you want to edit.

  4. media-if-peercheck-time—Enter the amount of time in milliseconds for the standby system in an HA node to receive responses to its ARP requests via the media interface before it takes over the active role from its counterpart.

    The default is 0, which turns the HA media interface keepalive off. The maximum value is 500 milliseconds.

  5. Save and activate your configuration.