Overview

Disaster Recovery is a specific subset of High Availability. Traditionally, NMS High Availability refers to mechanisms within a given NMS site that enable recovery or failover to backup hardware within the same site—typically based on some form of Unix clustering. This approach is distinct from Disaster Recovery, which is the topic of this chapter.

Note: Within this chapter, Disaster Recovery may sometimes be referred to as High Availability.

The optional NMS Disaster Recovery module consists of a group of components that work together to support the continual monitoring and disaster recovery of NMS across two or more sites. The High Availability module provides:

• Scripts to perform switchover or failover from an NMS instance at one site to an NMS instance at another site (Disaster Recovery). An NMS site is defined as a logical grouping of software and associated hardware components that support one or more sets of NMS end-user applications (an active NMS instance).

• Continual monitoring of the state of each critical software component within each NMS site.

• A comprehensive set of web pages for reporting the state of each site and its significant software components, including configuration of the NMS sites being monitored.