Fault monitoring for the HA for SAP MaxDB data service is provided by the following fault monitors:
The SAP MaxDB fault monitor
The SAP xserver fault monitor
Each fault monitor is contained in a resource whose resource type is shown in the following table.
|
System properties and extension properties of these resources control the behavior of the fault monitors. The default values of these properties determine the preset behavior of the fault monitors. The preset behavior should be suitable for most Oracle Solaris Cluster installations. Therefore, you should tune the HA for SAP MaxDB fault monitors only if you need to modify this preset behavior.
Tuning the HA for SAP MaxDB fault monitors involves the following tasks:
Setting the interval between fault monitor probes
Setting the timeout for fault monitor probes
Defining the criteria for persistent faults
Specifying the failover behavior of a resource
For more information, see Tuning Fault Monitors for Oracle Solaris Cluster Data Services in Planning and Administering Data Services for Oracle Solaris Cluster 4.4. Information about the HA for SAP MaxDB fault monitors that you need to perform these tasks is provided in the subsections that follow.
HA for SAP MaxDB also enables you to control how the fault monitor responds if the SAP MaxDB parent kernel process is not running. For more information, see Forcing the SAP MaxDB Database Instance to Be Restarted if the Parent Kernel Process Is Terminated.
Tune the HA for SAP MaxDB fault monitors when you register and configure HA for SAP MaxDB. For more information, see Registering and Configuring HA for SAP MaxDB.
To determine whether SAP xserver and the SAP MaxDB database instance are operating correctly, the HA for SAP MaxDB fault monitors probe these resources periodically. The optimum interval between fault monitor probes depends on the time that is required to respond to a fault in a resource. This time depends on how the complexity of the resource affects the time that is required for operations such as restarting the resource.
For example, SAP xserver is a much simpler resource and can be restarted much quicker than SAP MaxDB. Therefore, the optimum interval between fault monitor probes of SAP xserver is shorter than the optimum interval between probes of SAP MaxDB.
The optimum timeout for fault monitor probes depends on the operations that a fault monitor performs to probe the resource.
During a probe, the SAP MaxDB fault monitor performs the following operations:
The SAP MaxDB fault monitor determines whether the SAP MaxDB database instance is online.
If the SAP MaxDB database instance is online, the SAP MaxDB fault monitor determines whether the parent kernel process of the SAP MaxDB database instance is running. You can control how the fault monitor responds if the parent kernel process is not running. For more information, see Forcing the SAP MaxDB Database Instance to Be Restarted if the Parent Kernel Process Is Terminated.
The SAP MaxDB fault monitor determines whether SAP xserver is available. This fault monitoring supplements the fault monitoring that the SAP xserver fault monitor provides.
During a probe, the SAP xserver fault monitor determines whether SAP xserver is available.
Faults that each HA for SAP MaxDB fault monitor detects are described in the subsections that follow.
The SAP MaxDB fault monitor detects the following faults in SAP MaxDB:
A status of the SAP MaxDB database instance that is not ONLINE, for example, OFFLINE or ADMIN
Unexpected termination of the parent kernel process of the SAP MaxDB database instance
The SAP MaxDB fault monitor also detects the unavailability of SAP xserver. This fault monitoring supplements the fault monitoring that the SAP xserver fault monitor provides.
The SAP xserver fault monitor detects following faults:
Unavailability of SAP xserver. Unavailability of SAP xserver is also detected by the SAP MaxDB fault monitor.
Persistent system errors. A persistent system error is a system error that occurs four times within the retry interval. If a persistent system error occurs, the fault monitor restarts SAP xserver.
To minimize the disruption that transient faults in a resource cause, a fault monitor restarts the resource in response to such faults. For persistent faults, more disruptive action than restarting the resource is required:
For the SAP MaxDB resource, the fault monitor fails over the resource to another node. The SAP MaxDB resource is a failover resource.
For the SAP xserver resource, the fault monitor takes the resource offline. The SAP xserver is a multiple-master resource.
By default, unexpected termination of the parent kernel process does not cause the SAP MaxDB fault monitor to restart the SAP MaxDB database instance. The SAP MaxDB database instance can continue to function without the parent kernel process. Restarting the SAP MaxDB database instance in this situation might cause unnecessary unavailability of the SAP MaxDB database instance. Therefore, you should force the SAP MaxDB database instance to be restarted only if you require a feature that the parent kernel process provides. An example of such a feature is maintaining the integrity of the log history.
To force the SAP MaxDB database instance to be restarted if the parent kernel process is terminated, set the Restart_if_Parent_Terminated extension property of the SUNW.sapdb resource to True.