Skip Headers

Oracle Real Application Clusters Guard Concepts and Administration Guide
Release 3.2.1 for Windows NT and Windows 2000

Part Number A95197-01
Go To Table Of Contents
Contents
Go To Index
Index

Go to previous page Go to next page

6
Troubleshooting

This chapter provides general information on the troubleshooting tools provided with Oracle Real Application Clusters Guard Manager. The following table shows the information provided in this chapter:

Topic  Reference 

Verify Operations 

Section 6.1 

Dump Cluster 

Section 6.2 

Handling Errors and Troubleshooting Problems with Configured Databases 

Section 6.3 

Finding Additional Troubleshooting Information 

Section 6.4 

Note that Oracle Real Application Clusters Guard provides a centralized message facility. When you perform an action that results in an error, the system locates the message associated with the error and displays it. You can find more information about these messages in the Oracle Services for MSCS Error Messages manual.

6.1 Verify Operations

Oracle Real Application Clusters Guard provides a family of tools to help you verify cluster components and the cluster environment to validate the status of nodes, groups, and resources. If a discrepancy or a problem is found, the verify operation takes the appropriate action to fix any potential or actual problems.

Figure 6-1 shows the commands in the Troubleshooting menu.

Figure 6-1 Troubleshooting Menu and Verify Commands


Text description of menutrbl.gif follows.
Text description of the illustration menutrbl.gif

Table 6-1 describes the verify commands and provides references for more information.

Table 6-1  Verify Commands for Troubleshooting
Tool  Description  Reference 

Verify Cluster 

Validates the Oracle Real Application Clusters Guard installation, the Oracle Real Application Clusters product installation (including Oracle homes and product version numbers), cluster network configuration, and cluster resource DLL registration. 

Section 6.1.1 

Verify Group 

Validates that the group resources and their dependencies are configured correctly. 

Section 6.1.2 

Verify Real Application Clusters Database 

Validates the Oracle Real Application Clusters database instances. 

Section 6.1.3 

You can use the verify commands at any time to validate your cluster, group, or configured Oracle Real Application Clusters database. If problems are found during verification, Oracle Real Application Clusters Guard prompts you to fix them or returns an error message that further describes the problem. Oracle Real Application Clusters Guard does not make any changes without first prompting you.

6.1.1 Verify Cluster

The Verify Cluster operation validates the installation and network configuration of the cluster. You can perform a cluster verification at any time by choosing:

Troubleshooting -> Verify Cluster

The first time you connect to a cluster after installing or upgrading the Oracle Real Application Clusters Guard software, you are prompted to run Verify Cluster. You can run Verify Cluster at any time, however, and you should run it whenever the cluster configuration changes. The Verify Cluster operation verifies that:

The Verify Cluster operation also registers Oracle resource DLLs with Microsoft Cluster Server (MSCS).

Figure 6-2 shows the output from a typical Verify Cluster operation.

Figure 6-2 Clusterwide Operation Window for Verify Cluster


Text description of cwo_vc.gif follows.
Text description of the illustration cwo_vc.gif

If you run the Verify Cluster operation and it does not complete successfully, it might indicate one or more of the following problems:

If the operation completes successfully, but you are having problems with Oracle Real Application Clusters Guard, the problem is based in the Oracle Real Application Clusters configuration.

6.1.2 Verify Group

The Verify Group operation does the following to ensure that a group will perform correctly:

You can run the Verify Group operation at any time. However, you should run it when any of the following occurs:

You can verify a group by selecting the group from the Oracle Real Application Clusters Guard Manager tree view and then choosing:

Troubleshooting -> Verify Group

Or, you can run a Verify Group operation using the ORACGCMD command VERIFYGROUP (see Chapter 5). The ORACGCMD command also provides a VERIFYALLGROUPS command that allows you to verify all groups configured by Oracle Real Application Clusters Guard on a given cluster. You can run the VERIFYGROUP and VERIFYALLGROUPS commands in scripts as batch jobs.

You can watch the progress of the Verify Group operation and view the status of the individual resources in the group as Oracle Real Application Clusters Guard verifies the group.

Figure 6-3 shows the output from a Verify Group operation.

Figure 6-3 Clusterwide Operation Window for Verify Group


Text description of cwo_vg.gif follows.
Text description of the illustration cwo_vg.gif

6.1.3 Verify Oracle Real Application Clusters Database

You can validate a configured Oracle Real Application Clusters database at any time using the Verify Real Application Clusters Database operation. To issue the Verify Real Application Clusters Database command, select the database from the Oracle Real Application Clusters Guard Manager tree view, and then choose:

Troubleshooting -> Verify Real Application Clusters Database

The Verify Real Application Clusters Database operation verifies that all databases instances are configured and then calls the verify group operation to verify each of the instance groups. The verify database operation returns warning messages if it finds that not all database instances are configured or if the cluster metadata indicates there are more instances configured for the database than are actually associated with the database. (Use the Configure Additional Instances command or Unconfigure command to remedy these situations. See Section 3.4 or Section 3.5 for information on these commands.)

Figure 6-4 shows the output from a typical Verify Real Application Clusters Database operation in a Clusterwide Operation window.

Figure 6-4 Clusterwide Operation Window for Verify Real Application Clusters Database


Text description of cwo_verifyorac.gif follows.
Text description of the illustration cwo_verifyorac.gif

If any problems are found during verification, the Verify Real Application Clusters Database operation prompts you before it attempts to fix them.

6.2 Dump Cluster

Oracle Real Application Clusters Guard provides the Dump Cluster command to display Oracle Real Application Clusters Guard cluster data in a window. You might issue this command periodically (and save the output) to maintain a record of changes made to the cluster over time, or you might issue it at the request of customer support to provide a snapshot of the cluster environment.

Data presented when you issue the Dump Cluster command includes:

You can optionally save the Dump Cluster data to a file by clicking the Save As button.

To issue the Dump Cluster command, from the Troubleshooting menu, choose the Dump Cluster command.

Figure 6-5 shows a portion of the Dump Cluster output.

Figure 6-5 Dump Cluster Clusterwide Operation


Text description of dumpcluster.gif follows.
Text description of the illustration dumpcluster.gif

6.3 Handling Errors and Troubleshooting Problems with Configured Databases

The following sections describe how to troubleshoot specific problems that you may encounter with configured Oracle Real Application Clusters databases. For general information about troubleshooting Oracle Real Application Clusters databases, see the Oracle Real Application Clusters documentation.

In most cases, the first step in troubleshooting a problem is to issue the Verify Cluster, Verify Group, or Verify Real Application Clusters Database command.

6.3.1 Problems Placing a Group Online

If there is a problem placing a group online, try the following:

6.3.2 Instance Group Fails Over During Processing-Intensive Operations

Sometimes, processing-intensive operations (such as an Import operation) can cause Is Alive polling to fail and may result in an undesired group failover. In such cases, you can disable Is Alive polling for the instance by issuing the ORACGCMD DISABLEISALIVE command. However, be aware that when you disable Is Alive polling, Oracle Real Application Clusters Guard suspends monitoring the instance until Is Alive polling is reenabled. You reenable Is Alive polling with the ORACGCMD ENABLEISALIVE command.

Oracle Corporation recommends that you issue these ORACGCMD commands from within a script so that you can ensure that Is Alive polling is reenabled when the processing-intensive operation completes.

For information on the ORACGCMD commands, see Chapter 5.

6.3.3 ORA-12500 Errors and the listener.log File

An entry is logged to the listener.log file every time a connection is made to a database. Because Oracle Real Application Clusters Guard connects to the database each time it performs Is Alive polling, the listener.log file can grow quite large, very quickly. When it becomes very large (2 Mb or more), it can create a drain on resources, which can lead to the ORA-12500 error being returned.

To avoid this error, you can do any one of the following:

6.3.4 Problems Creating Listeners

Oracle Real Application Clusters Guard Manager uses the Listener Control Utility (LSNRCTL) to create new listeners, and captures the output in a file located in your Oracle home.

For example, if the Oracle home and network directory path is C:\ORANT\NETWORK\ADMIN, the listener output files will be written to the following directory and file:

C:\ORANT\NETWORK\LOG\fslnode.out

Each listener has its own output file that is named using the listener name and the .out extension. (In the example, the listener name is fslnode.) If you experience difficulties when creating a new listener, you can use the output file to help you diagnose the problem.

6.3.5 Archived listener.ora or tnsnames.ora Files

Whenever Oracle Real Application Clusters Guard makes changes in the listener.ora or tnsnames.ora files, the original version of the file is archived. If you need to reference an Oracle Net net service name definition or a listener definition as it was before Oracle Real Application Clusters Guard changed the definition, you can look at the archived versions of the configuration files.

Oracle Real Application Clusters Guard keeps up to two archived versions of configuration files. The file name of the archived version has a format of <filename>_000.ora and <filename>_001.ora. Note that <filename>_000.ora is the most recent file.

6.3.6 Rollback Files

Whenever Oracle Real Application Clusters Guard encounters an error during an operation after Oracle Net configuration files have been changed, the updated version of the file is saved as <filename>_rlb.ora. Then, the original version of the file is restored.

The rollback version of the file may be useful for problem diagnosis.

6.3.7 Clients Cannot Access an Oracle Real Application Clusters Database

If users and client applications are unable to access a database that is configured into an MSCS cluster, perform the following steps to fix the problem:

  1. Update the tnsnames.ora file to use the virtual server for each instance group.

  2. Run the Verify Group command with each instance group to validate the network (Oracle Net) configuration.

  3. Run the Verify Real Application Clusters Database command to validate the database.

6.4 Finding Additional Troubleshooting Information

This chapter described how to use the Oracle Real Application Clusters Guard Manager family of troubleshooting tools and how to troubleshoot problems with configured Oracle Real Application Clusters databases. Additional information is available as follows:


Go to previous page Go to next page
Oracle
Copyright © 2001 Oracle Corporation.

All Rights Reserved.
Go To Table Of Contents
Contents
Go To Index
Index