24 5.1.6 Recovery Scenario 6 (Database Recovery)

5.1.6.1 Recovery Scenario 6: Case 1

For a partial outage with

Cloud Disaster Recovery Guide

  • Server having a corrupted database.
  • Replication channel from parent is inhibited because of upgrade activity.
  • Server is in a different release then that of its Active parent because of upgrade activity.
  • Verify that the Server Runtime backup files, performed at the start of the upgrade, are present in /var/TKLC/db/filemgmt area in the following format
    • Backup.DSR.HPC02-NO2.FullDBParts.NETWORK_OAMP.20140524_223507.UPG.tar.bz2
    • Backup.DSR.HPC02-NO2.FullRunEnv.NETWORK_OAMP.20140524_223507.UPG.tar.bz2

Note:

During recovery, the corrupted Database will get replaced by the sever Runtime backup. Any configuration done after taking the backup will not be visible post recovery.

This procedure performs recovery if database is corrupted in the system.

Check off () each step as it is completed. Boxes have been provided for this purpose under each step number.

If this procedure fails, contact My Oracle Support (MOS), and ask for assistance.

Table 24-1 Procedure 10

STEP # Procedure Description
1 Workarounds

Refer to Workarounds for Issues not fixed in this Release to understand/apply any workarounds required during this procedure.

2 NOAM VIP GUI: Set Failed Servers to OOS

Navigate to Main Menu -> Status & Manage -> HA

Figure 24-1 HA


HA

Select Edit

Set the Max Allowed HA Role drop down box to OOS for the failed servers.

Select Ok

Figure 24-2 Cancel


Cancel

3 Server in Question: Login

Establish an SSH session to the server in question. Login as admusr user.

4 Server in Question: Change runlevel to 3

Execute the following command to bring the system to runlevel 3:

Figure 24-3 init3


init3

5 Server in Question: Recover System
Execute the following command and follow the instructions appearing the console prompt:

Figure 24-4 Backout restore


Backout restore

6 Server in Question: Change runlevel to 4

Execute the following command to bring the system back to runlevel 4:

Figure 24-5 Init 6


Init 6

7 Server in Question: Verify the server

Execute the following command to verify if the processes are up and running:

Figure 24-6 getprocs


getprocs

8 NOAM VIP GUI: Set Failed Servers to Active

Figure 24-7 HA


HA

Click on Edit at the bottom of the screen.

For each failed server whose Max Allowed HA Role is set to OOS, set it to Active.

Press OK.

9 Backup and Archive All the Databases from the Recovered System

Execute DSR Database Backup to back up the Configuration databases:

5.1.6.2 Recovery Scenario 6: Case 2

For a partial outage with

  • Server having a corrupted database.
  • Replication channel is not inhibited.
  • Server has the same release as that of its Active parent.

This procedure performs recovery if database got corrupted in the system and system is in the state to get replicated

Check off () each step as it is completed. Boxes have been provided for this purpose under each step number.

If this procedure fails, contact My Oracle Support (MOS), and ask for assistance.

Table 24-2 Procedure 11

STEP # Procedure Description
1 Workarounds

Refer to Release to understand/apply any workarounds required during this procedure.

2 NOAM VIP GUI: Set Failed Servers to OOS

Navigate to Main Menu -> Status & Manage -> HA

Figure 24-8 HA


HA

Select Edit

Set the Max Allowed HA Role drop down box to OOS for the failed servers.

Select Ok

3 Server in Question: Login

Establish an SSH session to the server in question. Login as admusr user.

4 Server in Question: Take Server out of Service

Execute the following command to take the server out of service.

Figure 24-9 Clobber


Clobber

5 Server in Question: Take Server to DbUp State and Start the Application

Execute the following commands to take the server to Dbup and start the DSR application:

Figure 24-10 sudo-prod-start


sudo-prod-start

6 Server in Question: Verify the Server State

Execute the following commands to verify the processes are up and running:

Figure 24-11 get procs


get procs

Execute the following command to verify if replication channels are up and running:

Figure 24-12 irep stat


irep stat

Execute the following command to verify if merging channels are up and running:

Figure 24-13 inetmstat


inetmstat

7 NOAM VIP GUI: Restart DSR application

Navigate to Main Menu->Status & Manage->Server,

Figure 24-14 Server


Server

Select each recovered server and click on Restart.

Figure 24-15 Restart


Restart

8 NOAM VIP GUI: Set Failed Servers to Active

Figure 24-16 HA


HA

Click on Edit at the bottom of the screen

For each failed server whose Max Allowed HA Role is set to OOS, set it to Active

Press OK

9 Backup and Archive All the Databases from the Recovered System

Execute DSR Database Backup to back up the Configuration databases: