Oracle Real Application Clusters Guard Administration and Reference Guide
Release 9.0.1

Part Number A88810-02
Go To Documentation Library
Home
Go To Product List
Book List
Go To Table Of Contents
Contents
Go To Index
Index

Master Index

Feedback

Go to previous page Go to next page

6
Troubleshooting Oracle Real Application Clusters Guard

This chapter describes how to troubleshoot an Oracle Real Application Clusters Guard system. It includes the following topics:

Interpreting Message Output in Oracle Real Application Clusters Guard

Oracle Real Application Clusters Guard provides detailed error messages that can help in troubleshooting. Error messages from the Oracle database server and from third-party media vendors also provide useful troubleshooting output. This section contains the following topics:

Identifying Types of Message Output

Table 6-1 shows the types of message output that are useful for troubleshooting Oracle Real Application Clusters Guard.

Table 6-1 Message Output  
Type of Output  Produced By  Location  Description 

PFSCTL messages 

PFSCTL command line 

Standard output (terminal and PFSCTL.log

 

PFS messages 

Packs and monitors 

OFA: $ORACLE_BASE
/admin/$DB_NAME/pfs
/pfsdump/pfs_$ORACLE_SERVICE_
Host.log

Non-OFA: $ORACLE_HOME
/pfs/$DB_NAME/log
/pfs_$ORACLE_SERVICE_
Host.log 

Contains a chronological log of actions that are relevant to Oracle Real Application Clusters Guard, error messages that are generated by Oracle Real Application Clusters Guard and the Oracle database server, and administrative operations 

PFS debug file 

$PFS_DEBUGGING command 

OFA: $ORACLE_BASE
/admin/$DB_NAME/pfs
/pfsdump/pfs_$ORACLE_SERVICE_
Host.debug

Non-OFA: $ORACLE_HOME
/pfs/$DB_NAME/log
/pfs_$ORACLE_SERVICE_
Host.debug 

Contains detailed output generated by Oracle Real Application Clusters Guard processes. This file is appended when $PFS_DEBUGGING is set to $PFS_TRUE

Monitor log files 

Heartbeat monitor 

OFA: $ORACLE_BASE/admin
/$DB_NAME/pfs/pfsdump
/pfs_$ORACLE_SERVICE_
Host_oraping.log

Non-OFA: $ORACLE_HOME/pfs
/$DB_NAME/log/pfs_$ORACLE_SERVICE__
Host_oraping.log 

Contains information about the functioning of the heartbeat monitor 

PFS trace file 

Heartbeat monitor 

The directory specified by the USER_DUMP_DEST initialization parameter 

Contains SQL*Trace output, including wait and bind data. This file is created when TRACE_FLAG is set to $PFS_TRUE in the ORAPING_CONFIG table.

See Also: "Making Online Changes to the ORAPING_CONFIG Table" 

Fault data capture 

Listener monitor 

$ORACLE_HOME/network/trace/pfs_lsnr_LsnrPid.trc

or

$ORACLE_HOME/network/trace/pfs_lsnr_Time.trc

where Time is the time in seconds when the listener terminated 

Contains output generated by the netstat UNIX command 

Fault data capture 

Pack 

The directory specified by the USER_DUMP_DEST initialization parameter 

Contains output generated by Oracle system state dump and utlclust.sql script 

Alert log 

Oracle database server 

The directory specified by the BACKGROUND_DUMP_DEST initialization parameter 

Contains a chronological log of errors, initialization parameter file settings, and administrative operations 

Oracle trace file 

Oracle database server 

The directory specified by the USER_DUMP_DEST initialization parameter 

Contains detailed output generated by Oracle server processes 

System logs 

Operating system 

 

 

Identifying Error Codes

The following types of error codes are found in the Oracle Real Application Clusters Guard logs and trace files:

Oracle Real Application Clusters Guard Error Message Numbers

Table 6-2 shows the error ranges for Oracle Real Application Clusters Guard error messages:

Table 6-2 Oracle Real Application Clusters Guard Error Ranges  
Error Range  Cause 

0000-0999 

Generic 

1000-2999 

Main layer 

3000-3999 

OS-dependent layer 

4000-4999 

Pack layer 

5000-5999 

PFSCTL command line 

6000-6999 

Instance monitor 

7000-7999 

Listener monitor 

8000-8999 

Heartbeat monitor 

9000-9999 

Internal Oracle Real Application Clusters Guard process and role change notification 

10000-19999 

Oracle Real Application Clusters Guard Setup Utility (PFSSETUP

Interpreting Oracle Real Application Clusters Guard Error Messages

Note the following suggestions for identifying useful messages in the Oracle Real Application Clusters Guard log files:

Example: Interpreting Oracle Real Application Clusters Guard Errors

The following is an example of messages from the Oracle Real Application Clusters Guard log file, pfs_SALES_hostA.log:

Wed Jan 10 11:57:14 2001 PFS-6014: Info: Routine connecting to instance.
Wed Jan 10 11:57:14 2001  ERROR: Encountered Oracle error while executing CONNECT --!
Wed Jan 10 11:57:14 2001  ORA-01034: ORACLE not available
ORA-27101: shared memory realm does not exist
SVR4 Error: 2: No such file or directory
Wed Jan 10 11:57:14 2001 PFS-6016: Alert: Routine failed to connect to instance.
Wed Jan 10 11:57:14 2001 PFS-6003: Warning: Routine 1 exits. Attempt to stop routine 0.
Wed Jan 10 11:57:14 2001 PFS-6006: Alert: ORACLE instance is not available. Instance 
monitor exits.

Read the log in chronological order. The first alert message is:

PFS-6016: Alert: Routine failed to connect to instance.

The Oracle Real Application Clusters Guard error number is 6016, which means that the problem concerns the instance monitor. The messages before the alert contain an ORA-01034 error:

ORA-01034: ORACLE not available

You can conclude that the pack was halted because the Oracle instance or the database is down.

Interpreting Debugging Output

If the standard Oracle Real Application Clusters Guard logging is not generating enough information, then the Oracle Real Application Clusters Guard debug option can be used to generate more extensive output. Enable the Oracle Real Application Clusters Guard debug option by setting the PFS_DEBUGGING parameter to $PFS_TRUE.

Use debugging for the following purposes:

The output is redirected to a separate trace file to prevent overloading the Oracle Real Application Clusters Guard log file. The debugging output contains the following information:

Troubleshooting Start-Up Problems

Use the PFSBOOT to start the packs. The steps of the PFSBOOT command are:

  1. Check the prerequisites for executing the PFSBOOT command. These conditions cannot exist:

    • Check whether packs are already running.

    • Check whether the instance is running outside of the packs.

    • Check whether failover or restart is occurring.

  2. Start the packs.

If the PFSBOOT command fails, then check the following items:

Are there errors in the Oracle Real Application Clusters Guard logs?

Are there errors in the alert logs?

Is the cluster up and running?

Is the network operating properly?

Oracle Corporation recommends setting up the call-home function to alert the user when the PFSBOOT command fails during normal processing.

See Also:

"Setting Up the Call-Home Feature" 

The Oracle Real Application Clusters Guard logs should clearly describe why the PFSBOOT command failed. You may need to stop the database manually before re-issuing the PFSBOOT command. The PFSBOOT command may also fail if the packs are running in foreign mode or if the monitors do not start successfully.

See Also:

 

This section contains the following examples:

Example: PFSBOOT Command Fails

When you enter the PFSBOOT command, the following message may appear at the command line:

Alert: pfsboot command failed.

Diagnosis

The following output appears in the Oracle Real Application Clusters Guard log on hostA (pfs_SALES_hostA.log):

Fri Jan 12 16:15:07 2001 PFS-5014: Processing command pfsboot.
Fri Jan 12 16:15:08 2001 PFS-5074: Alert: System is not clear. Pack PFS_SALES_hostA is 
running. Use PFSCTL PFSHALT first.
Fri Jan 12 16:15:09 2001 PFS-5080: Alert: pfsboot command failed

The first alert message is:

PFS-5074: Alert: System is not clear. Pack PFS_SALES_hostA is running. Use 
PFSCTL PFSHALT first.

The message number indicates that the problem is in the PFSCTL command line. The text of the message indicates that the PFS_SALES_hostA pack is already running. Enter the STATUS command to find out the exact state of the packs:

PFSCTL> status

The following output results:

Info: Pack PFS_SALES_hostA started.
hostB
Info: Pack PFS_SALES_hostB started.
Info: Local database instance is up.
Info: Remote database instance is up.
Info: Running primary role locally.
Info: Running secondary role on remote node.
Info: Both nodes running as part of the cluster.
Info: Local node part of the cluster.
Info: Remote node part of the cluster.
Info: No internal process is running locally.
Info: No internal process is running remotely.
status command succeeded.

Solution

The STATUS command shows that both packs are running. If you want to restart the packs, then:

  1. Halt both packs. Enter the PFSHALT command:

    PFSCTL> pfshalt
    
    
  2. Start both packs. Enter the PFSBOOT command:

    PFSCTL> pfsboot
    

Example: PFSBOOT Command Fails

When you enter the PFSBOOT command, the following message may appear at the command line:

Alert: pfsboot command failed.

Diagnosis

The following output appears in the Oracle Real Application Clusters Guard log on hostA (pfs_SALES_hostA.log):

Mon Jan 15 10:02:57 2001 PFS-4019: Info: Attempt to send notification that instance role 
has changed.
Mon Jan 15 10:02:58 2001 PFS-5555: SALES hostA unknown planned_up 2001/01/15-10:02:57
Mon Jan 15 10:02:58 2001 PFS-2021: Info: Calling user provided role change notification 
script: /mnt1/oracle/admin/sales/pfs/user/pfs_SALES_notifyrole.sh
Mon Jan 15 10:02:59 2001 PFS-2012: Info: User role notification script succeeded
Mon Jan 15 10:03:08 2001 PFS-4005: Info: Pack PFS_SALES_hostA starting on home node.
Mon Jan 15 10:03:09 2001 PFS-4010: Info: Attempt to initialize all variables.
Mon Jan 15 10:03:10 2001 PFS-4011: Info: Attempt to enable IP address.
Mon Jan 15 10:03:11 2001 PFS-4012: Info: Attempt to acquire disk storage.
Mon Jan 15 10:03:11 2001 PFS-4013: Info: Attempt to start public listener monitor and 
public listener SALES_hostA_LSNR.
Mon Jan 15 10:03:12 2001 PFS-7001: Info: Attempt to start private listener monitor and 
private listener SALES_hostA_PRIVLSNR.
Mon Jan 15 10:03:13 2001 PFS-2020: Info: Start monitor avmlprog SALES_hostA_LSNR 12432
Mon Jan 15 10:03:14 2001 PFS-4014: Info: Attempt to start database instance.
Mon Jan 15 10:03:14 2001 PFS-2020: Info: Start monitor avmlprog SALES_hostA_PRIVLSNR 12540
Mon Jan 15 10:03:16 2001 PFS-1000: Alert: Attempt to start Oracle instance failed.
Mon Jan 15 10:03:21 2001 PFS-5050: Alert: PFSCTL BOOTONE failed.
.
.
.
Mon Jan 15 10:03:50 2001 PFS-5064: Alert: Attempt to start primary failed.
Mon Jan 15 10:03:51 2001 PFS-5080: Alert: pfsboot command failed.

The first alert message is:

PFS-1000: Alert: Attempt to start Oracle instance failed.

The message number indicates that the problem was reported from the Oracle Real Application Clusters Guard main layer. The text of the message reports a problem with starting the Oracle instance.

The alert log (alertSALES1.log) does not show an entry for instance startup.

Try to start the database manually outside of the packs. Enter the following commands:

$ sqlplus /nolog
SQL*Plus: release 9.0.1.0.0 - Production on Mon Jan 15 10:26:11 2001
© Copyright 2001 Oracle Corporation. All rights reserved.

SQL> connect / as sysdba
Connected to an idle instnace.

SQL> startup pfile=init_SALES1_hostA.ora
LRM-00101: unknown parameter name `service_name'
ORA-01078: failure in processing system parameters

The Oracle errors indicate that there is a problem with the SERVICE_NAME initialization parameter.

Solution

Correct the problem with the initialization parameter. Restart the packs:

PFSCTL> pfsboot

Example: PFSBOOT Starts and Fails

When you enter the PFSBOOT command, the resulting output shows that the command succeeded:

PFSCTL> pfsboot
pfsboot command succeeded.

When you enter the STATUS command, the following output may result:

hostA
Info: Pack PFS_SALES_hostA started.
hostB
Info: Pack PFS_SALES_hostB started.
Info: Local database instance is up.
Info: Remote database instance is up.
Info: Running primary role locally.
Info: Running secondary role on remote node.
Info: Both nodes running as part of the cluster.
Info: Local node part of the cluster.
Info: Remote node part of the cluster.
Info: No internal process is running locally.
Info: No internal process is running remotely.
status command succeeded.

The output shows that although the PFSBOOT command started the instances, it shut down before starting other processes.

Diagnosis

If the packs start successfully and then shut down, then the following scenarios are possible:

Examine the Oracle Real Application Clusters Guard log, the database log, and the trace files for errors:

Mon Jan 15 14:37:15 2001 PFS-4019: Info: Attempt to send notification that instance role 
has changed.
Mon Jan 15 14:37:26 2001 PFS-4005: Info: Pack PFS_SALES_hostA starting on home node.
Mon Jan 15 14:37:27 2001 PFS-4010: Info: Attempt to initialize all variables.
Mon Jan 15 14:37:28 2001 PFS-4011: Info: Attempt to enable IP address.
Mon Jan 15 14:37:28 2001 PFS-4012: Info: Attempt to acquire disk storage.
Mon Jan 15 14:37:29 2001 PFS-4013: Info: Attempt to start public listener monitor and 
public listener SALES_hostA_LSNR.
Mon Jan 15 14:37:30 2001 PFS-7001: Info: Attempt to start private listener monitor and 
private listener SALES_hostA_PRIVLSNR.
Mon Jan 15 14:37:30 2001 PFS-2020: Info: Start monitor avmlprog SALES_hostA_LSNR 8964
Mon Jan 15 14:37:32 2001 PFS-4014: Info: Attempt to start database instance.
Mon Jan 15 14:37:32 2001 PFS-2020: Info: Start monitor avmlprog SALES_hostA_PRIVLSNR 9069
Mon Jan 15 14:37:57 2001 PFS-4032: Info: Check if ACTIVE_INSTANCE_COUNT is set to 1.
Mon Jan 15 14:37:58 2001 PFS-4015: Info: Attempt to start instance monitor.
Mon Jan 15 14:37:59 2001 PFS-4016: Info: Attempt to check INSTANCE_ROLE.
Mon Jan 15 14:38:00 2001 PFS-2020: Info: Start monitor avmuprog SALES 9557
Mon Jan 15 14:38:01 2001 PFS-1001: Info: INSTANCE_ROLE is primary_instance.
Mon Jan 15 14:38:02 2001 PFS-4017: Info: Attempt to start ORACLE_PING.
Mon Jan 15 14:38:03 2001 PFS-2020: Info: Start monitor avmpprog SALES 9745
Mon Jan 15 14:38:03 2001 PFS-4018: Info: Attempt to enable pack switching.
Mon Jan 15 14:38:04 2001 PFS-4019: Info: Attempt to send notification that instance role 
has changed.
Mon Jan 15 14:38:04 2001 PFS-8002: Warning: Instance sales1 is not registered with SALES_
hostA_LSNR.
Mon Jan 15 14:38:05 2001 PFS-5555: SALES hostA primary up 2001/01/15-14:38:04
Mon Jan 15 14:38:05 2001 PFS-2021: Info: Calling user provided role change notification 
script: /mnt1/oracle/admin/sales/pfs/user/pfs_SALES_notifyrole.sh
Mon Jan 15 14:38:06 2001 PFS-2012: Info: User role notification script succeeded
Mon Jan 15 14:38:06 2001 PFS-9900: Info: Attempt to start role change notification.
Mon Jan 15 14:38:08 2001 PFS-4020: Info: Attempt to start DBMS_JOBS.
Mon Jan 15 14:38:08 2001 PFS-2020: Info: Start monitor avmrprog SALES 10010
Mon Jan 15 14:38:09 2001 PFS-4004: Info: Run method on home node completed.
Mon Jan 15 14:38:13 2001 PFS-5002: PFSCTL BOOTONE succeeded.
Mon Jan 15 14:38:15 2001 PFS-4019: Info: Attempt to send notification that instance role 
has changed.
Mon Jan 15 14:38:15 2001 PFS-8002: Warning: Instance sales1 is not registered with SALES_
hostA_LSNR.
Mon Jan 15 14:38:26 2001 PFS-8002: Warning: Instance sales1 is not registered with SALES_
hostA_LSNR.
Mon Jan 15 14:38:36 2001 PFS-8002: Warning: Instance sales1 is not registered with SALES_
hostA_LSNR.
Mon Jan 15 14:38:47 2001 PFS-8002: Warning: Instance sales1 is not registered with SALES_
hostA_LSNR.
Mon Jan 15 14:38:58 2001 PFS-8002: Warning: Instance sales1 is not registered with SALES_
hostA_LSNR.
Mon Jan 15 14:39:09 2001 PFS-8002: Warning: Instance sales1 is not registered with SALES_
hostA_LSNR.
Mon Jan 15 14:39:10 2001 PFS-5002: PFSCTL BOOTONE succeeded.
Mon Jan 15 14:39:11 2001 PFS-5007: PFSCTL PFSBOOT succeeded.

Mon Jan 15 14:39:12 2001 PFS-3000: Info: Pack PFS_SALES_hostA started.
Mon Jan 15 14:39:14 2001 PFS-3000: Info: Pack PFS_SALES_hostB started.
Mon Jan 15 14:39:15 2001 PFS-3002: Info: Local database instance is up.
Mon Jan 15 14:39:17 2001 PFS-3004: Info: Remote database instance is up.
Mon Jan 15 14:39:19 2001 PFS-3006: Info: Running primary role locally.
Mon Jan 15 14:39:19 2001 PFS-8002: Warning: Instance sales1 is not registered with SALES_
hostA_LSNR.
Mon Jan 15 14:39:22 2001 PFS-3010: Info: Running secondary role on remote node.
Mon Jan 15 14:39:26 2001 PFS-3012: Info: Both nodes running as part of the cluster.
Mon Jan 15 14:39:29 2001 PFS-3013: Info: Local node part of the cluster.
Mon Jan 15 14:39:30 2001 PFS-8002: Warning: Instance sales1 is not registered with SALES_
hostA_LSNR.
Mon Jan 15 14:39:31 2001 PFS-3014: Info: Remote node part of the cluster.
Mon Jan 15 14:39:32 2001 PFS-3072: Info: No internal process is running locally.
Mon Jan 15 14:39:33 2001 PFS-3073: Info: No internal process is running remotely
Mon Jan 15 14:39:34 2001 PFS-5015: pfsboot command succeeded.
Mon Jan 15 14:39:41 2001 PFS-8002: Warning: Instance sales1 is not registered with SALES_
hostA_LSNR.
Mon Jan 15 14:39:51 2001 PFS-8002: Warning: Instance sales1 is not registered with SALES_
hostA_LSNR.
Mon Jan 15 14:40:02 2001 PFS-8002: Warning: Instance sales1 is not registered with SALES_
hostA_LSNR.
Mon Jan 15 14:40:13 2001 PFS-8002: Warning: Instance sales1 is not registered with SALES_
hostA_LSNR.
Mon Jan 15 14:40:23 2001 PFS-8005: Alert: MTS service or instance sales1 not registered 
with SALES_hostA_LSNR in 120 seconds. Exit. Mon Jan 15 14:40:24 2001 
Mon Jan 15 14:40:25 2001 PFS-2019: Info: PFS callhome with Oraping_monitor_exits now.
Mon Jan 15 14:40:25 2001 PFS-2019: Info: PFS callhome with Oraping_for_SALES_error_Will_
failover now.
Mon Jan 15 14:40:26 2001 PFS-2019: Info: PFS callhome with Failing_over_service_SALES now.
Mon Jan 15 14:40:30 2001 PFS-4007: Info: Pack PFS_SALES_hostA stopping on home node.
Mon Jan 15 14:40:31 2001 PFS-4019: Info: Attempt to send notification that instance role 
has changed.
Mon Jan 15 14:40:31 2001 PFS-5555: SALES hostA primary down 2001/01/15-14:40:31
Mon Jan 15 14:40:32 2001 PFS-2021: Info: Calling user provided role change notification 
script: /mnt1/oracle/admin/sales/pfs/user/pfs_SALES_notifyrole.sh
Mon Jan 15 14:40:32 2001 PFS-2012: Info: User role notification script succeeded
Mon Jan 15 14:40:33 2001 PFS-4028: Info: Attempt to halt instance monitor.
Mon Jan 15 14:40:34 2001 PFS-4029: Info: Attempt to halt ORACLE_PING.
Mon Jan 15 14:40:35 2001 PFS-3064: Info: Service PFS_SALES_Ping_hostA has already been 
stopped.
Mon Jan 15 14:40:35 2001 PFS-9902: Info: Attempt to stop role change notification Mon Jan 
15 14:40:36 2001 PFS-4027: Info: Attempt to archive, checkpoint, and dump database.
Mon Jan 15 14:40:45 2001 PFS-1012: Info: Local and remote ORACLE systemstates dumped to 
USER_DUMP_DEST.
Mon Jan 15 14:40:45 2001 PFS-4026: Info: Attempt to abort database.
Mon Jan 15 14:40:46 2001 PFS-4019: Info: Attempt to send notification that instance role 
has changed.
Mon Jan 15 14:40:47 2001 PFS-5555: SALES hostA primary cleanup 2001/01/15-14:40:46
Mon Jan 15 14:40:47 2001 PFS-2021: Info: Calling user provided role change notification 
script: /mnt1/oracle/admin/sales/pfs/user/pfs_SALES_notifyrole.sh
Mon Jan 15 14:40:48 2001 PFS-2012: Info: User role notification script succeeded
Mon Jan 15 14:40:49 2001 PFS-2003: Info: Attempt to start internal PFS process on primary 
instance.
Mon Jan 15 14:40:49 2001 PFS-4025: Info: Attempt to halt public listener monitor
and public listener SALES_hostA_LSNR.
Mon Jan 15 14:40:50 2001 PFS-7003: Info: Attempt to halt private listener monitor and 
private listener SALES_hostA_PRIVLSNR.
Mon Jan 15 14:40:51 2001 PFS-2017: Info: Start to clean up PFS processes.
Mon Jan 15 14:40:52 2001 PFS-4024: Info: Attempt to release disk storage.
Mon Jan 15 14:40:52 2001 PFS-2015: Info: Stop process ./avmlmon.sh_SALES_hostA_LSNR 
succeeded. 
Mon Jan 15 14:40:52 2001 PFS-4022: Info: Attempt to disable IP address.
Mon Jan 15 14:40:53 2001 PFS-2015: Info: Stop process SALES_hostA_LSNR succeeded
Mon Jan 15 14:40:53 2001 PFS-4030: Info: Halt method on home node completed.
Mon Jan 15 14:40:54 2001 PFS-2015: Info: Stop process ./avmlmon.sh_SALES_hostA_PRIVLSNR 
succeeded.
Mon Jan 15 14:40:54 2001 PFS-2015: Info: Stop process  SALES_hostA_PRIVLSNR succeeded.

The first warning is:

Mon Jan 15 14:38:15 2001 PFS-8002: Warning: Instance sales1 is not registered 
with SALES_hostA_LSNR.

The first alert is:

Mon Jan 15 14:40:23 2001 PFS-8005: Alert: MTS service or instance sales1 not 
registered with SALES_hostA_LSNR in 120 seconds. Exit. Mon Jan 15 14:40:24 2001 

The message numbers are in the 8000 range, so the problem has been reported from the heartbeat monitor. The message text indicates that there is a problem with service registration. The instance failed to register with the listener within 120 seconds.

Check the environment variable and the initialization parameters that affect service registration:

Is the TNS_ADMIN environment variable set correctly in the $PFS_HOME/include/$ORACLE_SERVICE.env file?

Are the following initialization parameters set correctly?

SERVICE_NAMES
ACTIVE_INSTANCE_COUNT
INSTANCE_NAME
Does the LOCAL_LISTENER parameter (for dedicated connections) specify a valid alias in the $TNS_ADMIN/tnsnames.ora file?

Does the LISTENER attribute of the DISPATCHERS parameter (for multi-threaded server connections) specify a valid alias in the $TNS_ADMIN/tnsnames.ora file?

For example, in a dedicated configuration:

Parameter  Value 

ORACLE_SERVICE 

SALES 

Relocatable IP address 

144.25.28.74 

Port 

1524 

If LOCAL_LISTENER is defined in the SALES_config.hostA.ded.pfs file as follows:

LOCAL_LISTENER=listener_SALES_hostA 

then listener_SALES_hostA must be resolved properly in the tnsnames.ora file:

listener_SALES_hostA=
     (ADDRESS=(PROTOCOL=TCP)(HOST=144.25.28.74)(PORT=1524))

Solution

There are several causes of failed service registration. The best practice is to look for the simplest solutions first. For example, it is common for service registration to fail because the LOCAL_LISTENER parameter is not set correctly. Ensure that the value of the LOCAL_LISTENER parameter in the initialization parameter file (init.ora) matches the entry in the tnsnames.ora file.

Troubleshooting Command Line Problems

If you cannot invoke the PFSCTL command line, then check the following conditions:

Was Oracle Real Application Clusters Guard installed and deployed correctly?

The oracle user must install the Oracle Real Application Clusters Guard software. If another user installed the Oracle Real Application Clusters Guard software, then re-install the Oracle Real Application Clusters Guard software as oracle.

Are you logged on as the super user (root)?

If you are not logged on as root, then you will see output similar to the following:

      PFSCTL for hostA: Version 9.0.1- Production on  Jan 15 2001 16:49:59
        (c) Copyright 2001 , Oracle Corporation.  All rights reserved.
Welcome to PFSCTL. Type HELP for additional information.
pfsctl[38]: /home_oracle/901_sales/pfs/bin/PFSCTL.log: cannot create

ORACLE_SERVICE is set to SALES
DB_NAME is set to sales

PFSCTL> 
Are the following environment variables set?

ORACLE_SERVICE
DB_NAME

If ORACLE_SERVICE is not set, then you will see output similar to the following:

       PFSCTL for hostA:Version 9.0.1- Production on  Jan 15 2001 16:47:30
       (c) Copyright 2001, Oracle Corporation. All rights reserved.
Welcome to PFSCTL. Type HELP for additional information.

Alert: ORACLE_SERVICE is not set. Set it and run PFSCTL again.

If DB_NAME is not set, then you will see output similar to the following:

       PFSCTL for hostA:Version 9.0.1- Production on  Jan 15 2001 16:47:30
       (c) Copyright 2001, Oracle Corporation. All rights reserved.
Welcome to PFSCTL. Type HELP for additional information.

Alert: DB_NAME is not set. Set it and run PFSCTL again.
Is $ORACLE_HOME/pfs/bin in the PATH variable?

If $ORACLE_HOME/pfs/bin is not in the PATH variable, then you will see output similar to the following:

# pfsctl
pfsctl: command not found

If $ORACLE_HOME/pfs/bin is not in the PATH variable, then you can execute the PFSCTL command line utility from $ORACLE_HOME/pfs/bin or you can include $ORACLE_HOME/pfs/bin is in the PATH variable.

Troubleshooting Oracle Real Application Clusters Guard Monitors

This section contains the following topics:

Troubleshooting the Heartbeat Monitor

If the heartbeat monitor is not operating properly, then check the following items:

Is the heartbeat monitor running?

The heartbeat monitor runs on the primary and secondary nodes. Check for its existence as follows:

$ ps -fu owner | grep avmpmon.sh

It should have a child process called avmping.

Are there error messages or messages about restarting the heartbeat monitor in the Oracle Real Application Clusters Guard logs?

For example, these messages may appear in pfs_SALES_hostA_ping.log:

Tue Jan 16 09:11:57 2001 PFS-8500: Info: Oracle Real Application Clusters 
Guard: oracle-ping v90100.
Tue Jan 16 09:11:57 2001 PFS-8501: Info: Rampup flag is set ON.
Tue Jan 16 09:11:57 2001 PFS-8503: Info: ORACLE_PING started.. Check 
PFS$ORAPING_HEARTBEAT view.
Tue Jan 16 09:11:57 2001  ERROR : Encountered Oracle error while executing 
SELECT--!
Tue Jan 16 09:11:57 2001  ORA-00942: table or view does not exist
Are the heartbeat monitor tables in the database?

Look for oraping_heartbeat and oraping_config, whose owner is OPS$owner. The tables are created by $ORACLE_HOME/pfs/admin/catpfs.sql.

Troubleshooting the Instance Monitor

If the instance monitor is not operating properly, then check the following items:

Is the instance monitor running?

The instance monitor runs on the primary and secondary nodes. Check for its existence by looking for the following message in the pfs_$ORACLE_SERVICE_host.log:

Tue Jan 16 09:28:31 2001 PFS-2020: Info: Start monitor avmuprog SALES 5312

There should be a child process for avmuprog (PID 5312) called avmumon. Check for its existence with the following command:

$ ps -fu PFS_owner | grep avmumon
Are the instance monitor tables in the database?

Look for pfs_up_$INSTANCE_NAME, owned by owner. There should be 2 tables with 2 unique instance names. They are created by $ORACLE_HOME/pfs/admin/catpfs.sql.

Are there error messages in the Oracle Real Application Clusters Guard log?

Troubleshooting the Listener Monitor

If the listener monitor is not operating properly, then check the following items:

Is the TNS_ADMIN environment variable set correctly?

Check the settings in the $PFS_HOME/include/$ORACLE_SERVICE.env and $PFS_HOME/include/PFS_$ORACLE_SERVICE_User.def files.

Are the listener names correct?

Check the listener.ora and tnsnames.ora files under the $TNS_ADMIN directory.

Are the relocatable IP addresses enabled?

Use the netstat UNIX command to check. For example, if your relocatable IP address is 139.185.141.55, make sure that you see an entry for it in the netstat output. Enter the following command:

$ netstat -rn

You should see output similar to the following:

Routing Table: IPv4
  Destination           Gateway           Flags  Ref   Use   Interface
-------------------- -------------------- ----- ----- ------ ---------
204.152.65.16        204.152.65.17         U        1    465  hme2
204.152.65.0         204.152.65.1          U        1    465  hme1
204.152.65.32        204.152.65.33         U        1    465  hme1:1
139.185.141.0        139.185.141.99        U        1   1200  hme0
139.185.141.0        139.185.141.106       U        1      0  hme0:6
139.185.141.0        139.185.141.105       U        1      0  hme0:5
139.185.141.0        139.185.141.55        U        1      0  hme0:7
224.0.0.0            139.185.141.99        U        1      0  hme0
default              139.185.141.1         UG       1   3117
127.0.0.1            127.0.0.1             UH       4 729224  lo0  

See Also: Table 6-3

Does the listener monitor exist?

For example, find the listener monitor PID in the pfs_$ORACLE_SERVICE_host.log:

Mon Feb 12 16:02:42 2001 PFS-2020: Info: Start monitor avmlprog sales_
hostA_LSNR 17878

17878 is the listener monitor PID.

Enter the following command to find out whether the listener monitor exists:

$ ps -ef | grep 17878

You should see output similar to the following:

root 17937 17878  0 16:02:42 ? 0:08 /bin/ksh ./avmlmon.sh sales_hostA_LSNR
Are there error messages in the Oracle Real Application Clusters Guard log?

For example, in the pfs_SALES_hostA.log:

Mon Jan 15 13:29:00 2001 PFS-4037: Alert: Attempt to start listener 
monitor failed.

Troubleshooting the System Outside of the Packs

The packs cannot solve underlying performance or stability problems in the system. If such problems exist, then you must solve them outside of the packs. To troubleshoot outside of the packs, follow these steps:

  1. Stop the packs. Enter the following command:

    PFSCTL> pfshalt
    
    
  2. Enable the relocatable IP addresses and storage groups. See Table 6-3.

  3. Start the listeners that listen on the relocatable IP addresses.

  4. Start the Oracle9i Real Application Clusters database.

  5. Ensure that the instances are registered with the listeners.

  6. Try to reproduce the problem to locate its source. Solve the problem.

  7. Shut down the listeners.

  8. Shut down the Oracle9i Real Application Clusters database.

  9. Disable the relocatable IP addresses and storage groups. See Table 6-3.

  10. Start the packs. Enter the following command:

    PFSCTL> pfsboot
    
    Table 6-3 Enabling and Disabling IP Addresses and Storage Groups on Sun and HP Platforms
    Action  HP Platform  Sun Platform 

    Enable IP address 

    # cmmodnet -a -i IP_address IP_subnet 

    1. List the available network interface:

      # ifconfig IF:N IP_address up
      

      where

      IF is the interface (for example, hme0)

      N is an integer greater than 0

    2. Enable the IP address on the new interface:

      # ifconfig IF addif IP_address 
      netmask mask up
      
     

    Disable IP address 

    # cmmodnet -r -i IP_address IP_subnet 

    # ifconfig IF removeif IP_address netmask mask down 

    Enable storage group 

    # vgchange -a s volume_group 

    Done automatically by Sun Cluster Manager 

    Disable storage group 

    # vgchange -s n volume_group 

    Done automatically by Sun Cluster Manager 

Example: Enabling an IP Address on the Sun Platform

To enable 144.25.27.74 as a relocatable IP address, enter the following command:

# ifconfig hme0:1 144.25.28.74 up

Display the IP addresses by entering the following command:

# ifconfig -a

You should see output similar to the following:

lo0: flags=1000849<UP,LOOPBACK,RUNNING,MULTICAST,IPv4> mtu 8232 index 1
        inet 127.0.0.1 netmask ff000000
hme0: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index 2
        inet 144.25.28.70 netmask fffffc00 broadcast 144.25.31.255
hme0:1: flags=1000862<BROADCAST,NOTRAILERS,RUNNING,MULTICAST,IPv4> mtu 1500 ind2
        inet 144.25.28.74 netmask fffffc00 broadcast 144.25.31.255
hme1: flags=1008863<UP,BROADCAST,NOTRAILERS,RUNNING,MULTICAST,PRIVATE,IPv4> mtu3
        inet 204.152.65.1 netmask fffffff0 broadcast 204.152.65.15
hme1:1: flags=1008843<UP,BROADCAST,RUNNING,MULTICAST,PRIVATE,IPv4> mtu 1500 ind3
        inet 204.152.65.33 netmask fffffff0 broadcast 204.152.65.47

Example: Enabling an IP Address on the HP Platform

To enable 195.1.1.150 as a relocatable IP address, enter the following command;

# cmmodnet -a -i 195.1.1.150 195.1.1.0

Display the IP address by entering the following command:

$ netstat -in

You should see output similar to the following:

Name     Mtu     Network          Address          Ipkts    Opkts
lan2     1500    192.1.1.0        192.1.1.3        81859    40987
lan5:1   1500    195.1.1.0        195.1.1.150      0        0
lan0     1500    139.185.141.0    139.185.151.34   22782    23614
lo0      4136    127.0.0.0        127.0.0.1        30084    30084
lan5     1500    195.1.1.0        195.1.1.3        81855    40984


Go to previous page Go to next page
Oracle
Copyright © 1996-2001, Oracle Corporation.

All Rights Reserved.
Go To Documentation Library
Home
Go To Product List
Book List
Go To Table Of Contents
Contents
Go To Index
Index

Master Index

Feedback