Working With Disaster Recovery Plans
A DR plan describes the operations that must be performed on the resources that are under the protection of the disaster recovery service. A DR plan is associated with a DR configuration, and is executed by an administrator either when a site-level incident is detected (failover), or when one of the sites must be taken offline (switchover). After a failover, when the affected system is back online, postfailover operations are performed to ensure that both systems are ready to run new DR operations.
About DR Operations and Default Plans
The native DR service provides plans with default steps for each type of operation. DR plan steps can be customized. The built-in plans are configured as follows:
- Switchover Plan
-
When a switchover is performed, there is no outage, so both peered systems are online. The goal is to move all resources covered in the DR configuration from the primary system (A) to the standby system (B). When completed, system B becomes the primary and system A the standby for the resources in question.
The plan starts with prechecks to ensure that both systems meet the requirements to allow compute instances to be stopped on the primary system and started again on the standby system. The prechecks include site mappings as well as other critical elements, such as tags, security lists, or network security groups. The role reversal precheck specifically ensures that the ZFS Storage Appliance in each rack is in the correct state.
When the prechecks are completed without errors, the DR configuration on the primary system (A) is frozen and its compute instances are stopped, so the role reversal can begin. Based on resource metadata exchanged between the peered systems, and replicated data on the standby ZFS Storage Appliance, the target system (B) is prepared to assume the primary role for the instances in the DR configuration. The replication process is reversed and ready to use the source system (A) as its standby as soon as the switchover is complete.
Using the replicated volumes, the compute instances in the DR configuration are launched on the standby system (B). An identical DR configuration is created on the standby system, with all source and target resources in the site mappings inverted. The metadata of the newly launched instances is stored in the DR configuration. On the primary system (A) a cleanup is performed: the DR configuration is disabled and its compute instances are terminated.
To complete the switchover, data replication from the new primary system (B) to the standby system (A) is started, the DR plans are moved to the new standby system (A), and the storage project and metadata associated with the original DR configuration are deleted from system A.
- Failover Plan
-
A failover is performed on the standby system, when one of the peered systems goes down. The goal is to recover all resources covered in the DR configuration on the standby system (B), allowing continuation of service. The failover steps are similar to the switchover plan, but none of the operations on the primary system (A) can be performed. The primary system cannot be cleaned up until it comes back online.
The plan starts with prechecks to ensure that the standby system and its ZFS Storage Appliance are in the correct state to bring up the resources covered in the DR configuration. When the prechecks are completed without errors, the role reversal begins.
Using the replicated metadata and resources, the compute instances in the DR configuration are launched on the standby system (B), which assumes the primary role. An identical DR configuration is created on system B, which has become the primary, with inverted site mappings and metadata collected from the newly launched instances. In preparation of the original primary system (A) coming back online, the replication process is reversed and ready to use system A as the standby.
When the original primary system (A) eventually comes online, the remaining steps to return the DR configuration to a correct working state are performed by executing the postfailover plan.
- Postfailover Plan
-
A postfailover plan is performed after a failover, when the system that experienced an outage comes back online, and the peer connection is restored. The goal is to clean up the DR configuration on the primary system that went down (A), and set it up as the standby for the new primary system (B).
There are no prechecks in a postfailover plan. System A is back online after an outage and needs to be cleaned up: the DR configuration is disabled and its compute instances are terminated. Data replication from the new primary system (B) to the standby system (A) is started, the DR plans are moved to the new standby system (A), and the storage project and metadata associated with the original DR configuration are deleted from system A.
To move resources that were originally hosted on system A back from system B, the administrator must perform a switchover from B to A for the relevant DR configuration(s).
Creating and Maintaining DR Plans
Each DR plan defines the steps to perform when a DR operation is executed: failover,
switchover, or postfailover. Each DR configuration has an associated DR plan file. This file
can contain multiple DR plans, each identified by a unique name assigned by the administrator
during creation. The DR plan file is stored in JSON format in the directory
/mnt/dr_metadata
on the standby rack.
Creating a DR Plan
The DR service provides default plans for each standard operation. You create them with a simple command to get started. If necessary, you can customize the plan steps afterward.
- Using the Service CLI
-
-
Look up the ID of the DR configuration for which you want to create a DR plan (
drGetConfigs
). -
Create a default DR plan with the
create DrPlan
command.Typically, a DR configuration has associated DR plans for each operation type. In this example, plans are created for failover, switchover, and postfailover.
PCA-ADMIN> create DrPlan drConfigId=6e797d8b-7245-4d49-8e68-bf67f2d53041 operation=SWITCHOVER planName=sw1 JobId: eae66f69-7b99-420e-b324-7d8964b2202b Data: DrPlan id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1. Successfully started job for DR Plan Create for config_id 6e797d8b-7245-4d49-8e68-bf67f2d53041 PCA-ADMIN> create DrPlan drConfigId=6e797d8b-7245-4d49-8e68-bf67f2d53041 operation=FAILOVER planName=fo1 PCA-ADMIN> create DrPlan drConfigId=6e797d8b-7245-4d49-8e68-bf67f2d53041 operation=POSTFAILOVER planName=pfo1
-
Use the job ID to check the status of the operation you started.
PCA-ADMIN> show Job id=eae66f69-7b99-420e-b324-7d8964b2202b Data: Id = eae66f69-7b99-420e-b324-7d8964b2202b Type = Job Associated Work Request Id = ec0f39df-6256-4c4c-a839-0d00a8f326dc Done = true Name = CREATE_TYPE Progress Message = DrPlan id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1. Successfully created [sw1] plan on STANDBY for DR operation [switchover] for config [6e797d8b-7245-4d49-8e68-bf67f2d53041] Run State = Succeeded Transcript = Created job CREATE_TYPE Username = admin WorkItemIds 1 = id:5ca6d187-e01a-40e2-bc97-3193a9a88742 type:WorkItem name:
-
To display a list of existing DR plans for a DR configuration, use the
drListPlan
command.PCA-ADMIN> drListPlan drConfigId=6e797d8b-7245-4d49-8e68-bf67f2d53041 Data: id Plan Name Operation -- --------- --------- 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1 sw1 switchover 6e797d8b-7245-4d49-8e68-bf67f2d53041::fo1 fo1 failover 6e797d8b-7245-4d49-8e68-bf67f2d53041::pfo1 pfo1 postfailover
-
To display the status and details of a DR plan, use the
show DrPlan
command.PCA-ADMIN> show DrPlan id=6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1 Data: Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1 Type = DrPlan Plan Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1 Plan Name = sw1 Config Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041 Operation = switchover Steps 1 - Step Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::precheck Steps 1 - Step Name = PRECHECK Steps 1 - Enabled = true Steps 1 - Last Status = norun Steps 1 - Command = None Steps 1 - Check Only = true Steps 2 - Step Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::role_reversal_precheck Steps 2 - Step Name = ROLE_REVERSAL_PRECHECK Steps 2 - Enabled = true Steps 2 - Last Status = norun Steps 2 - Command = None Steps 2 - Check Only = true Steps 3 - Step Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::stop_primary Steps 3 - Step Name = STOP_PRIMARY Steps 3 - Enabled = true Steps 3 - Last Status = norun Steps 3 - Command = None Steps 3 - Check Only = false Steps 4 - Step Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::role_reversal Steps 4 - Step Name = ROLE_REVERSAL Steps 4 - Enabled = true Steps 4 - Last Status = norun Steps 4 - Command = None Steps 4 - Check Only = false Steps 5 - Step Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::start_standby Steps 5 - Step Name = START_STANDBY Steps 5 - Enabled = true Steps 5 - Last Status = norun Steps 5 - Command = None Steps 5 - Check Only = false Steps 6 - Step Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::cleanup_primary Steps 6 - Step Name = CLEANUP_PRIMARY Steps 6 - Enabled = true Steps 6 - Last Status = norun Steps 6 - Command = None Steps 6 - Check Only = false Steps 7 - Step Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::post_config Steps 7 - Step Name = POST_CONFIG Steps 7 - Enabled = true Steps 7 - Last Status = norun Steps 7 - Command = None Steps 7 - Check Only = false
-
- Using the Service Web UI
-
-
Under Disaster Recovery Service, open the DR Configurations page. In the table, click the configuration to which you want to add a DR plan. The DR Configuration detail page appears.
-
In the Resources section, click Plans.
On the right hand side of the Plans box, click Add Plan.
-
In the Create DR Plan window, enter the following information in the respective fields:
-
Plan Name: Enter a name for this DR plan.
-
Operation: Select a standard operation: switchover, failover, postfailover.
-
Steps: Leave blank if you want the default steps for the selected operation. Otherwise, specify the steps to add to the DR plan. The options are: 'precheck', 'role_reversal_precheck', 'stop_primary', 'role_reversal', 'start_standby', 'cleanup_primary', 'post_config'.
-
Dr Plan Data File: Used for migrating first-generation DR plans. Leave blank.
-
Configurations To Migrate: Used for migrating first-generation DR plans. Leave blank.
-
-
Click Create DR Plan. A DR job is started. When it completes successfully, the DR plan appears in the resources table.
To track progress, under Disaster Recovery Service, select Jobs. The Jobs table reports the status of each job. Click a record in the table to display the job details.
-
Repeat these steps to add all the required DR plans.
-
Optionally, click a DR plan name to display its detail page.
-
Changing a DR Plan
A DR plan has a complex structure with many parameters, which makes it difficult to update from the command line in particular. Instead, you can change the individual steps that make up the DR plan. See Customizing the Steps in a DR Plan.
Deleting a DR Plan
You can remove a DR plan if it is no longer required. To delete all plans associated with a DR configuration at once, use the command drDeleteAllPlans.
- Using the Service CLI
-
-
Look up the ID of the DR plan you want to delete (
drListPlan
). -
Remove the selected DR plan with the
delete DrPlan
command.PCA-ADMIN> delete DrPlan id=6e797d8b-7245-4d49-8e68-bf67f2d53041::fo2 JobId: 603d480f-1e0f-4229-b596-aaaf8588e682 Data: DrPlan id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::fo2. Successfully started job for DR Plan delete for config_id 6e797d8b-7245-4d49-8e68-bf67f2d53041
-
Use the job ID to check the status of the operation you started.
PCA-ADMIN> show Job id=603d480f-1e0f-4229-b596-aaaf8588e682 Data: Id = 603d480f-1e0f-4229-b596-aaaf8588e682 Type = Job Associated Work Request Id = 391a0799-235a-4b26-aa99-4b5dd14ba19a Done = true Name = DELETE_TYPE Progress Message = DrPlan id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::fo2. Successfully deleted [fo2] plan on STANDBY for config [6e797d8b-7245-4d49-8e68-bf67f2d53041] Run State = Succeeded Transcript = Created job DELETE_TYPE Username = admin WorkItemIds 1 = id:86d0bd13-5f9c-4513-9404-60d8980b2243 type:WorkItem name:
-
- Using the Service Web UI
-
-
Under Disaster Recovery Service, open the DR Configurations page. In the table, click the configuration for which you want to delete a DR plan. The DR Configuration detail page appears.
-
In the Resources section, click Plans. All existing DR plans are displayed in the table.
-
In the Actions column, open the quick menu (3 dots) and click Delete.
Alternatively, click the DR plan name to display its detail page. In the top-right corner, click Delete.
-
When prompted, click Confirm. A DR job is started. When it completes successfully, the DR plan is removed from the table.
To track progress, under Disaster Recovery Service, select Jobs. The Jobs table reports the status of each job. Click a record in the table to display the job details.
-
Repeat until all obsolete DR plans have been removed.
-
Deleting All DR Plans
Instead of deleting DR plans one by one, you can delete all plans associated with a DR configuration at once.
- Using the Service CLI
-
-
Look up the ID of the DR configuration for which you want to delete all plans (
drGetConfigs
). -
Remove the selected DR plan with the
drdeleteallPlans
command.PCA-ADMIN> drdeleteallPlans drConfigId=6e797d8b-7245-4d49-8e68-bf67f2d53041 JobId: b595dd62-8046-4ca7-90a0-dcbbf084e663 Data: DrPlan id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::all_plans. Successfully started job for DR Plan delete for config_id 6e797d8b-7245-4d49-8e68-bf67f2d53041
-
Use the job ID to check the status of the operation you started.
PCA-ADMIN> show Job id=b595dd62-8046-4ca7-90a0-dcbbf084e663 Data: Id = b595dd62-8046-4ca7-90a0-dcbbf084e663 Type = Job Associated Work Request Id = d7cfb184-4e38-469d-b189-bb809386f5d4 Done = true Name = DELETE_TYPE Progress Message = DrPlan id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::all_plans. Successfully deleted [all_plans] plan on STANDBY for config [6e797d8b-7245-4d49-8e68-bf67f2d53041] Run State = Succeeded Transcript = Created job DELETE_TYPE Username = admin WorkItemIds 1 = id:6cfec1c9-4a72-492d-a414-38bf4df6cf3a type:WorkItem name:
-
- Using the Service Web UI
-
-
Under Disaster Recovery Service, open the DR Configurations page. In the table, click the configuration for which you want to delete a DR plan. The DR Configuration detail page appears.
-
In the Resources section, click Plans. All existing DR plans are displayed in the table.
Verify that all plans in the table should be deleted.
-
In the top-right corner, select Controls, then click Delete All Plans.
-
When prompted, click Confirm. A DR job is started. When it completes successfully, all DR plan are removed from the table.
To track progress, under Disaster Recovery Service, select Jobs. The Jobs table reports the status of each job. Click a record in the table to display the job details.
-
Customizing the Steps in a DR Plan
Customizing DR plan steps is the most convenient way to change a DR plan with default configuration. The administrator can enable or disable steps, configure steps to run in check-only mode, insert steps, and remove steps.
Note:
Custom steps and custom commands are not available in appliance software version 3.0.2-b1261765.
Changing the Properties of a DR Plan Step
You can enable or disable an existing DR plan step, and decide whether it should be performed in check-only mode.
- Using the Service CLI
-
-
Look up the ID of the DR plan step you want to change (
show DrPlan
). If you don't have the DR plan ID, look it up using the commandsdrGetConfigs
anddrListPlan
. -
Change the DR plan step with the
edit DrPlanStep
command. These are the properties you can change in order to affect how a DR plan is executed:-
enabled=[True|False]
– determines whether this step is performed or not -
checkOnly=[True|False]
– determines whether this step is performed in check-only mode -
command=[string]
– (NOT available) specifies a custom command to be run as part of this stepFor standard DR plan operations, the command parameter must be set to
None
.
PCA-ADMIN> edit DrPlanStep id=6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::precheck checkOnly=False JobId: d281141c-c388-490e-b038-239598488bc6 Data: DrPlanStep id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::precheck. Successfully started job for DR Plan Step update for config_id 6e797d8b-7245-4d49-8e68-bf67f2d53041
-
-
Use the job ID to check the status of the operation you started.
PCA-ADMIN> show Job id=d281141c-c388-490e-b038-239598488bc6 Data: Id = d281141c-c388-490e-b038-239598488bc6 Type = Job Associated Work Request Id = 8f40d0a3-b2ac-4742-bed8-70beb26d669d Done = true Name = MODIFY_TYPE Progress Message = DrPlanStep id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::precheck. Successfully updated plan step [precheck] in DR Plan [sw1] for config [6e797d8b-7245-4d49-8e68-bf67f2d53041] Transcript = Created job MODIFY_TYPE Username = admin WorkItemIds 1 = id:92a94a5e-4773-4ba8-ac4b-990496c5c2f9 type:WorkItem name:
-
To display the status and updated details of a DR plan step, use the
show DrPlanStep
command.PCA-ADMIN> show DrPlanStep id=6e797d8b-7245-4d49-8e68-bf67f2d53041::fo1::precheck Data: Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::fo1::precheck Type = DrPlanStep Step Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::fo1::precheck Step Name = PRECHECK Config Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041 Plan Name = fo1 Enabled = true Last Status = norun Command = None Check Only = false
-
- Using the Service Web UI
-
-
Under Disaster Recovery Service, open the DR Configurations page. In the table, click the configuration for which you want to modify a DR plan. The DR Configuration detail page appears.
-
In the Resources section, click Plans. Click the name of the plan for which you want to edit the steps. The DR Plan detail page appears. The Resources section displays all steps in the plan.
-
In the Actions column, open the quick menu (3 dots) for the step you want to change, and click Edit.
-
In the Update DR Plan Step window, edit the properties as needed:
-
Step Name: Do not change the selected step.
-
Enabled: Select Yes or No to determine whether this step is performed or not during plan execution.
-
Insert Type: Do not change. This field is used only for adding a new step.
-
Insert Location: Do not change. This field is used only for adding a new step.
-
Command: (NOT available.) Specify a custom command to be run as part of this step.
For standard DR plan operations, this field must remain empty. Custom commands only apply to custom DR plan steps.
-
Check Only: Select Yes or No to determine whether this step is performed or not in check-only mode.
-
-
Click Update DR Plan Step to apply your changes.
-
Inserting a DR Plan Step
Extra steps can be added to a DR plan.
- Using the Service CLI
-
-
Gather the information that you need to run the command:
-
DR configuration ID (
drGetConfigs
) -
DR plan name (
drListPlan
) -
name of the step before or after which the new step must be inserted (
show DrPlan
)
-
-
Add a step to the DR plan with the
create DrPlanStep
command.Note the mandatory and optional parameters. If the
insertType
andinsertLocation
parameters are not provided, the new step is added as the final step of the DR plan.Syntax (entered on a single line):
create DrPlanStep drConfigId=<DR_configuration_id> planName=<DR_plan_name> stepName=<DR_plan_new_step_name> [optional:] insertType=[BEFORE|AFTER] insertLocation=<DR_plan_existing_step_name> enabled=[True|False] checkOnly=[True|False] command=<execution_path>
Note:
Custom steps and custom commands are not available in appliance software version 3.0.2-b1261765.
-
Do not use the
command
parameter. -
Select a step name from this list:
precheck, role_reversal_precheck, stop_primary, role_reversal, start_standby, cleanup_primary, post_config
.
Example:
PCA-ADMIN> create DrPlanStep drConfigId=6e797d8b-7245-4d49-8e68-bf67f2d53041 planName=sw1 stepName=ROLE_REVERSAL insertType=BEFORE insertLocation=START_STANDBY JobId: 7a162b6c-1ddc-410c-b27a-0996fb2d26df Data: DrPlanStep id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::role_reversal. Successfully started job for DR Plan Step Create for config_id 6e797d8b-7245-4d49-8e68-bf67f2d53041
-
-
Use the job ID to check the status of the operation you started.
PCA-ADMIN> show Job id=7a162b6c-1ddc-410c-b27a-0996fb2d26df Data: Id = 7a162b6c-1ddc-410c-b27a-0996fb2d26df Type = Job Associated Work Request Id = a4c76ef9-f91d-402e-986f-9795738fb429 Done = true Name = CREATE_TYPE Progress Message = Successfully added step [role_reversal] in plan [sw1] for config [6e797d8b-7245-4d49-8e68-bf67f2d53041] Run State = Succeeded Transcript = Created job CREATE_TYPE Username = admin WorkItemIds 1 = id:5c376fae-f281-4f7a-984f-b996b19d367d type:WorkItem name:
-
Repeat the
create DrPlanStep
command to add all the required steps to the DR plan. -
To display the status and updated details of a DR plan, use the
show DrPlan
command.PCA-ADMIN> show DrPlan id=6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1 Data: Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1 Type = DrPlan Plan Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1 Plan Name = sw1 Config Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041 Operation = switchover Steps 1 - Step Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::precheck Steps 1 - Step Name = PRECHECK Steps 1 - Enabled = true Steps 1 - Last Status = norun Steps 1 - Command = None Steps 1 - Check Only = true Steps 2 - Step Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::role_reversal_precheck Steps 2 - Step Name = ROLE_REVERSAL_PRECHECK Steps 2 - Enabled = true Steps 2 - Last Status = norun Steps 2 - Command = None Steps 2 - Check Only = true Steps 3 - Step Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::stop_primary Steps 3 - Step Name = STOP_PRIMARY Steps 3 - Enabled = true Steps 3 - Last Status = norun Steps 3 - Command = None Steps 3 - Check Only = false Steps 4 - Step Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::role_reversal Steps 4 - Step Name = ROLE_REVERSAL Steps 4 - Enabled = true Steps 4 - Last Status = norun Steps 4 - Command = None Steps 4 - Check Only = false Steps 5 - Step Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::start_standby Steps 5 - Step Name = START_STANDBY Steps 5 - Enabled = true Steps 5 - Last Status = norun Steps 5 - Command = None Steps 5 - Check Only = false Steps 6 - Step Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::cleanup_primary Steps 6 - Step Name = CLEANUP_PRIMARY Steps 6 - Enabled = true Steps 6 - Last Status = norun Steps 6 - Command = None Steps 6 - Check Only = false Steps 7 - Step Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::post_config Steps 7 - Step Name = POST_CONFIG Steps 7 - Enabled = true Steps 7 - Last Status = norun Steps 7 - Command = None Steps 7 - Check Only = false
-
- Using the Service Web UI
-
-
Under Disaster Recovery Service, open the DR Configurations page. In the table, click the configuration for which you want to modify a DR plan. The DR Configuration detail page appears.
-
In the Resources section, click Plans. Click the name of the plan to which you want to add one or more steps. The DR Plan detail page appears. The Resources section displays all steps in the plan.
-
In the Resources section of the DR Plan detail page, on the right hand side of the Steps box, click Add Step.
-
In the Add DR Plan Step window, enter the following information in the respective fields:
-
Step Name: Select a standard step from the list.
-
Enabled: Select Yes or No to determine whether this step is performed or not during plan execution.
-
Insert Type: Select Before or After the insert location.
-
Insert Location: Select the existing step before or after which you want the new step to be inserted.
-
Command: (NOT available.) Specify a custom command to be run as part of this step.
For standard DR plan operations, this field must remain empty.
-
Check Only: Select Yes or No to determine whether this step is performed or not in check-only mode.
-
-
Click Add DR Plan Step to insert this new step in the selected location.
-
Repeat until all the required DR plan steps have been added.
-
Deleting a DR Plan Step
Steps can be removed from a DR plan.
- Using the Service CLI
-
-
Look up the ID of the DR plan step you want to delete (
show DrPlan
). If you don't have the DR plan ID, look it up using the commandsdrGetConfigs
anddrListPlan
. -
Delete the DR plan step with the
delete DrPlanStep
command.PCA-ADMIN> delete DrPlanStep id=6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::precheck JobId: c75a4c71-0525-40b7-9618-c2a4e8fcb051 Data: DrPlanStep id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::precheck. Successfully started job for DR Plan Step delete for config_id 6e797d8b-7245-4d49-8e68-bf67f2d53041
-
Use the job ID to check the status of the operation you started.
PCA-ADMIN> show Job id=c75a4c71-0525-40b7-9618-c2a4e8fcb051 Data: Id = c75a4c71-0525-40b7-9618-c2a4e8fcb051 Type = Job Associated Work Request Id = d7bd3873-e5a0-4437-b6c2-8ca0cce83953 Done = true Name = DELETE_TYPE Progress Message = DrPlanStep id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::precheck. Successfully deleted step [precheck] in plan [sw1] for config [6e797d8b-7245-4d49-8e68-bf67f2d53041] Run State = Succeeded Transcript = Created job DELETE_TYPE Username = admin WorkItemIds 1 = id:b462883b-6e6e-49b6-b455-1ceb54a5e2e3 type:WorkItem name:
-
Repeat the
delete DrPlanStep
command to remove all the required steps from the DR plan. -
To display the status and updated details of a DR plan, use the
show DrPlan
command.PCA-ADMIN> show DrPlan id=6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1 Data: Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1 Type = DrPlan Plan Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1 Plan Name = sw1 Config Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041 Operation = switchover Steps 1 - Step Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::stop_primary Steps 1 - Step Name = STOP_PRIMARY Steps 1 - Enabled = true Steps 1 - Last Status = norun Steps 1 - Command = None Steps 1 - Check Only = false Steps 2 - Step Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::role_reversal Steps 2 - Step Name = ROLE_REVERSAL Steps 2 - Enabled = true Steps 2 - Last Status = norun Steps 2 - Command = None Steps 2 - Check Only = false Steps 3 - Step Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::start_standby Steps 3 - Step Name = START_STANDBY Steps 3 - Enabled = true Steps 3 - Last Status = norun Steps 3 - Command = None Steps 3 - Check Only = false Steps 4 - Step Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::cleanup_primary Steps 4 - Step Name = CLEANUP_PRIMARY Steps 4 - Enabled = true Steps 4 - Last Status = norun Steps 4 - Command = None Steps 4 - Check Only = false Steps 5 - Step Id = 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1::post_config Steps 5 - Step Name = POST_CONFIG Steps 5 - Enabled = true Steps 5 - Last Status = norun Steps 5 - Command = None Steps 5 - Check Only = false
-
- Using the Service Web UI
-
-
Under Disaster Recovery Service, open the DR Configurations page. In the table, click the configuration for which you want to modify a DR plan. The DR Configuration detail page appears.
-
In the Resources section, click Plans. Click the name of the plan for which you want to edit the steps. The DR Plan detail page appears. The Resources section displays all steps in the plan.
-
In the Actions column, open the quick menu (3 dots) for the step you want to remove, and click Delete.
When prompted, click Confirm.
-
Repeat until all obsolete DR plan steps have been removed.
-
Executing a DR Plan
A DR plan can be executed from either the standby or the primary Private Cloud Appliance. However, in the case of a failover, the primary rack is down, so a failover plan is always executed from the standby system.
A switchover can be performed for the purpose of testing the disaster recovery setup, or when extensive maintenance is required on the primary system. To return both appliances to their normal working state after a failover, a postfailover plan is executed on each system when the primary is back online. The switchover plan has postfailover steps built in, so it does not require an additional run of the postfailover plan.
As a result of executing a DR plan, resources are moved between peered systems and the primary system changes. Those resources are not automatically moved back to their original host system. To move resources back to their original environment, you must perform another switchover for the relevant DR configuration(s).
Performing a Switchover
A switchover allows the administrator to move resources away from a system so it can be taken offline, for example in case of planned maintenance. A (second) switchover is also performed to move resources back to their original host system, after they were impacted by a failover or switchover.
- Using the Service CLI
-
-
Look up the ID of the switchover DR plan you want to execute. Use
drGetConfigs
to find the DR configuration, and display its associated DR plans usingdrListPlan
. -
From the primary or standby appliance, execute the switchover DR plan with the
drExecutePlan
command.Note:
To run the command in check-only mode, add the parameter
checkOnly=True
. Only the DR plan steps enabled for check-only mode will be performed.PCA-ADMIN> drExecutePlan planId=6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1 JobId: 92b4acc2-2dff-492c-9ba2-0a2ac058baa5 Data: DrPlan id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1. Successfully started job for DR Plan Execute for config_id 6e797d8b-7245-4d49-8e68-bf67f2d53041, plan_name sw1
-
Use the job ID to check the status of the operation you started.
PCA-ADMIN> show Job id=92b4acc2-2dff-492c-9ba2-0a2ac058baa5 Data: Id = 92b4acc2-2dff-492c-9ba2-0a2ac058baa5 Type = Job Associated Work Request Id = c6cca56c-a1cc-421c-9ded-acf0e7cd9da2 Done = false Name = OPERATION-EXECUTE_DR_PLAN Progress Message = DrPlan id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1. Successfully started job for DR Plan Execute for config_id 6e797d8b-7245-4d49-8e68-bf67f2d53041, plan_name sw1 Run State = Active Transcript = Created job OPERATION Username = admin WorkItemIds 1 = id:e06881fc-ea57-4835-bb86-e1244d3787c3 type:WorkItem name:
-
Ensure that the job completes successfully.
PCA-ADMIN> show Job id=92b4acc2-2dff-492c-9ba2-0a2ac058baa5 Data: Id = 92b4acc2-2dff-492c-9ba2-0a2ac058baa5 Type = Job Associated Work Request Id = c6cca56c-a1cc-421c-9ded-acf0e7cd9da2 Done = true Name = OPERATION-EXECUTE_DR_PLAN Progress Message = DrPlan id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1. DrPlan id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::sw1. drexecuteplan succeeded for config [6e797d8b-7245-4d49-8e68-bf67f2d53041] Operation: [switchover] plan_name: [sw1]. Response: [Successfully completed checks for switchover for DR config_id 6e797d8b-7245-4d49-8e68-bf67f2d53041. Plan Execution Status: [precheck : pass , role_reversal_precheck : pass , stop_primary : norun , role_reversal : norun , start_standby : norun , cleanup_primary : norun , post_config : norun , ]] Run State = Succeeded Transcript = Created job OPERATION Username = admin WorkItemIds 1 = id:e06881fc-ea57-4835-bb86-e1244d3787c3 type:WorkItem name:
After successful completion, all instances included in the DR configuration have been recovered and are running on the standby appliance.
-
- Using the Service Web UI
-
-
Under Disaster Recovery Service, open the DR Configurations page. In the table, click the configuration for which you want to perform a switchover. The DR Configuration detail page appears.
-
In the Resources section, click Plans.
-
In the Actions column, open the quick menu (3 dots) for the switchover plan of your choice, and click Execute Plan.
Alternatively, click the DR plan name to display its detail page. In the top-right corner, click Execute Plan.
-
When prompted, choose whether to execute the full plan or a subset of the steps in check-only mode.
Click Confirm. A DR job is started. When it completes successfully, all steps in the switchover DR plan have been performed as expected.
To track progress, under Disaster Recovery Service, select Jobs. The Jobs table reports the status of each job. Click a record in the table to display the job details.
After successful completion, all instances included in the DR configuration have been recovered and are running on the standby appliance.
-
Performing a Failover
The native DR service does not provide automated failover. An administrator must confirm that the primary appliance is down, and execute the failover plan from the standby appliance. A failover is meant to allow continuation of service when the primary system experiences an outage.
Note:
When one appliance is down, the peer rack reports a fault with a name containing
"peerconnect" and the rack serial number. Use the Service CLI to check the fault list (list
fault <parameters>
) and display the details of the peer connection problem.
For example:
PCA-ADMIN> show fault id=57701191-5764-480b-826c-38c4b1970dde Data: Cause = 1742XC3024 : network is not in a CONNECTED state: CONNECTING Action = Please contact customer support for solution Health Exporter = peerconnect-checker Diagnosing Source = peer connect health checker Faulted Component Type = SOFTWARE Description = 1749XC302P-- 1742XC3024 : network is not in a CONNECTED state: CONNECTING Name = 1749XC302P--PCA-8000-UY--peerconnect
- Using the Service CLI
-
-
Look up the ID of the failover DR plan you need to execute. Use
drGetConfigs
to find the DR configuration, and display its associated DR plans usingdrListPlan
. -
From the standby appliance, execute the failover DR plan with the
drExecutePlan
command.Note:
To run the command in check-only mode, add the parameter
checkOnly=True
. Only the DR plan steps enabled for check-only mode will be performed.PCA-ADMIN> drExecutePlan planId=6e797d8b-7245-4d49-8e68-bf67f2d53041::fo1 JobId: 49521287-c148-4791-9626-13190fce3d1d Data: DrPlan id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::fo1. Successfully started job for DR Plan Execute for config_id 6e797d8b-7245-4d49-8e68-bf67f2d53041, plan_name fo1
-
Use the job ID to check the status of the operation you started.
PCA-ADMIN> show Job id=49521287-c148-4791-9626-13190fce3d1d Data: Id = 49521287-c148-4791-9626-13190fce3d1d Type = Job Associated Work Request Id = c8e3b554-a3ef-4e9b-a52c-c9a518f70974 Done = false Name = OPERATION-EXECUTE_DR_PLAN Progress Message = DrPlan id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::fo1. Successfully started job for DR Plan Execute for config_id 6e797d8b-7245-4d49-8e68-bf67f2d53041, plan_name fo1 Run State = Active Transcript = Created job OPERATION Username = admin WorkItemIds 1 = id:d7a09483-ef2e-4e03-81bb-fed5ee661428 type:WorkItem name:
-
Ensure that the job completes successfully.
PCA-ADMIN> show Job id=49521287-c148-4791-9626-13190fce3d1d Data: Id = 49521287-c148-4791-9626-13190fce3d1d Type = Job Associated Work Request Id = c8e3b554-a3ef-4e9b-a52c-c9a518f70974 Done = true Name = OPERATION-EXECUTE_DR_PLAN Progress Message = DrPlan id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::fo1. DrPlan id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::fo1. drexecuteplan succeeded for config [6e797d8b-7245-4d49-8e68-bf67f2d53041] Operation: [failover] plan_name: [fo1]. Response: [Successfully completed checks for failover for DR config_id 6e797d8b-7245-4d49-8e68-bf67f2d53041. Plan Execution Status: [precheck : pass , role_reversal_precheck : pass , role_reversal : pass , start_standby : pass , ]] Run State = Succeeded Transcript = Created job OPERATION Username = admin WorkItemIds 1 = id:d7a09483-ef2e-4e03-81bb-fed5ee661428 type:WorkItem name:
After successful completion, all instances included in the DR configuration have been recovered and are running on the standby appliance.
-
- Using the Service Web UI
-
-
Under Disaster Recovery Service, open the DR Configurations page. In the table, click the configuration for which you want to perform a switchover. The DR Configuration detail page appears.
-
In the Resources section, click Plans.
-
In the Actions column, open the quick menu (3 dots) for the failover plan of your choice, and click Execute Plan.
Alternatively, click the DR plan name to display its detail page. In the top-right corner, click Execute Plan.
-
When prompted, choose whether to execute the full plan or a subset of the steps in check-only mode.
Click Confirm. A DR job is started. When it completes successfully, all steps in the switchover DR plan have been performed as expected.
To track progress, under Disaster Recovery Service, select Jobs. The Jobs table reports the status of each job. Click a record in the table to display the job details.
After successful completion, all instances included in the DR configuration have been recovered and are running on the standby appliance.
-
Performing Postfailover Operations
A postfailover is performed after a failover, when the system that experienced an outage comes back online. The plan can be executed from either of the peered systems. During postfailover, the DR configuration is cleaned up on the primary system that went down. The original standby system becomes the primary for the resources covered by the DR configuration, using the original primary as the new target for DR data replication.
- Using the Service CLI
-
-
After a failover, confirm that the primary appliance is back online and in healthy condition.
Ensure that the peering status is active and replication is enabled. Neither rack should report an active fault with a name containing "peerconnect". (Check with Service CLI command
list fault
.) -
Look up the ID of the postfailover DR plan you want to execute. Use
drGetConfigs
to find the DR configuration, and display its associated DR plans usingdrListPlan
. -
From the primary or standby appliance, execute the postfailover DR plan with the
drExecutePlan
command.Note:
For postfailover operations, the check-only mode does not apply.
PCA-ADMIN> drExecutePlan planId=6e797d8b-7245-4d49-8e68-bf67f2d53041::pfo1 JobId: 56d040ba-30a6-4bea-b924-78ebabed2626 Data: DrPlan id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::pfo1. Successfully started job for DR Plan Execute for config_id 6e797d8b-7245-4d49-8e68-bf67f2d53041, plan_name pfo1
-
Use the job ID to check the status of the operation you started.
PCA-ADMIN> show Job id=56d040ba-30a6-4bea-b924-78ebabed2626 Data: Id = 56d040ba-30a6-4bea-b924-78ebabed2626 Type = Job Associated Work Request Id = b4ad564b-e385-4688-94ff-11bf5267d72e Done = false Name = OPERATION-EXECUTE_DR_PLAN Progress Message = DrPlan id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::pfo1. Successfully started job for DR Plan Execute for config_id 6e797d8b-7245-4d49-8e68-bf67f2d53041, plan_name pfo1 Run State = Active Transcript = Created job OPERATION Username = admin WorkItemIds 1 = id:2e4db010-239e-41a1-aa0d-cb97167c64fc type:WorkItem name:
-
Ensure that the job completes successfully.
PCA-ADMIN> show Job id=56d040ba-30a6-4bea-b924-78ebabed2626 Data: Id = 56d040ba-30a6-4bea-b924-78ebabed2626 Type = Job Associated Work Request Id = b4ad564b-e385-4688-94ff-11bf5267d72e Done = true Name = OPERATION-EXECUTE_DR_PLAN Progress Message = DrPlan id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::pfo1. DrPlan id: 6e797d8b-7245-4d49-8e68-bf67f2d53041::pfo1. drexecuteplan succeeded for config [6e797d8b-7245-4d49-8e68-bf67f2d53041] Operation: [postfailover] plan_name: [pfo1]. Response: [Successfully completed checks for postfailover for DR config_id 6e797d8b-7245-4d49-8e68-bf67f2d53041. Plan Execution Status: [stop_primary : pass , cleanup_primary : pass , post_config : pass , ]] Run State = Succeeded Transcript = Created job OPERATION Username = admin WorkItemIds 1 = id:2e4db010-239e-41a1-aa0d-cb97167c64fc type:WorkItem name:
After successful completion, all instances impacted by the switchover or failover have been restored and are running on the appliance where they were hosted before.
-
- Using the Service Web UI
-
-
After a failover, confirm that the primary appliance is back online and in healthy condition.
Ensure that the peering status is active and replication is enabled. Neither rack should report an active fault with a name containing "peerconnect". (Display active faults in the Service Web UI.)
-
Under Disaster Recovery Service, open the DR Configurations page. In the table, click the configuration for which you want to perform postfailover operations. The DR Configuration detail page appears.
-
In the Resources section, click Plans.
-
In the Actions column, open the quick menu (3 dots) for the postfailover plan of your choice, and click Execute Plan.
Alternatively, click the DR plan name to display its detail page. In the top-right corner, click Execute Plan.
-
When prompted, click Confirm.
Note:
For postfailover operations, the check-only mode does not apply.
A DR job is started. When it completes successfully, all steps in the postfailover DR plan have been performed as expected.
To track progress, under Disaster Recovery Service, select Jobs. The Jobs table reports the status of each job. Click a record in the table to display the job details.
When the job has completed successfully, all instances impacted by the switchover or failover have been restored and are running on the appliance where they were hosted before.
-