5 Rolling Back cnDBTier
This chapter describes the procedures to roll back cnDBTier deployments.
Note:
- If cnDBTier is configured with a single replication channel, then rollback has to be done using single replication channel group. If cnDBTier is configured with multiple replication channel groups, then rollback must be done using multiple replication channel groups.
- db-backup-manager-svc is designed to automatically restart in case of errors. Therefore, when the backup-manager-svc encounters a temporary error during the rollback process, it may undergo several restarts. When cnDBTier reaches a stable state, the db-backup-manager-svc pod operates normally without any further restarts.
- You can roll back only one georedundant cnDBTier site at a time. If you want to roll back multiple georedundant sites, then complete the rollback of one georedundant site and then move to the next one.
- If you had performed the upgrade incrementally (from version n-2 to n-1, and then from n-1 to n), then rollback is not supported from n to n-2, or from n-1 to n-2.
- If you had modifyed the TLS certificates while upgrading the cnDBTier cluster from a TLS enabled version to another TLS enabled version, then before proceeding with the rollback procedure, perform the "Modifying cnDBTier Certificates to Establish TLS Between Georeplication Sites" procedure in Oracle Communications Cloud Native Core, cnDBTier User Guide to add the old certificates and retain the existing certificates.
- The
OCCNE_NAMESPACEvariable in the rollback procedures must be set to the cnDBTier namespace. Before running any command that contains theOCCNE_NAMESPACEvariable, ensure that you have set this variable to the cnDBTier namespace as stated in the following code block:export OCCNE_NAMESPACE=<namespace>where,
<namespace>is the cnDBTier namespace. - The namespace name
"occne-cndbtier"given in the rollback procedure is only an example. Ensure that you configure the namespace name according to your environment. - cnDBTier 24.3.1 supports Helm 3.12.3 and 3.13.2. Ensure that you upgrade Helm to a supported version.
5.1 Supported Rollback Paths
The following table provides the rollback paths that are supported by cnDBTier release 24.3.1.
Table 5-1 Supported Rollback Paths
| Source Release | Target Release |
|---|---|
| 24.3.1 | 24.3.x |
| 24.3.1 | 24.2.x |
| 24.3.1 | 24.1.x |
5.2 Prerequisites
- There must be no DDL objects created in MySQL after upgrading cnDBTier. If
there are any databases or tables, or any other DDL objects that were created after
upgrading the cluster, delete them before starting roll back.
- Run the following command to delete or drop a table created after
upgrade:
DROP TABLE table_created_after_upgrade; - Run the following command to delete or drop a database created after
upgrade:
DROP DATABASE database_created_after_upgrade;Note:
To delete other DDL objects, refer to the MySQL manual.
- Run the following command to delete or drop a table created after
upgrade:
5.3 Rolling Back cnDBTier
This section describes the procedure to roll back cnDBTier.
- Log in to the Bastion Host of the site being rolled back.
- List the history of the cnDBTier deployment:
- Run the following command to list the
namespace:
$ helm -n $OCCNE_NAMESPACE listSample output:NAME NAMESPACE REVISION UPDATED STATUS CHART APP VERSION mysql-cluster Cluster1 2 2024-02-19 20:11:30.592221615 +0000 UTC deployed occndbtier-24.3.1 24.3.1
- Run the following command to get the list of cnDBTier
deployments:
$ helm -n $OCCNE_NAMESPACE history mysql-clusterSample output:REVISION UPDATED STATUS CHART APP VERSION DESCRIPTION 1 Mon May 20 13:31:35 2024 superseded occndbtier-24.2.1 24.2.1 Install complete 2 Mon May 20 20:11:30 2024 deployed occndbtier-24.3.1 24.3.1 Upgrade complete
- Run the following command to list the
namespace:
- Before performing the rollback, run the Helm test on all the sites. There
are many scenarios where the Helm test can fail, including the health check failures
in various components of cnDBTier. If HELM test fails, diagnose and resolve the
actual issue and perform the HELM test again on the sites. Proceed further only when
the Helm test
succeeds.
$ helm test mysql-cluster --namespace ${OCCNE_NAMESPACE}Sample output:NAME: mysql-cluster LAST DEPLOYED: Mon May 20 10:22:58 2024 NAMESPACE: occne-cndbtier STATUS: deployed REVISION: 1 TEST SUITE: mysql-cluster-node-connection-test Last Started: Mon May 20 14:15:18 2024 Last Completed: Mon May 20 14:17:58 2024 Phase: Succeeded - If any of the following parameters were updated during an upgrade, then
restart the
ndbmtdpods with the--initialflag after the rollback:Note:
Perform this step only if you are upgrading from a previous version (24.2.x or 24.1.x).- DataDir
- FileSystemPath
- BackupDataDir
- FragmentLogFileSize
- InitFragmentLogFiles
- NoOfFragmentLogFiles
- NoOfFragmentLogParts
- FileSystemPathDD
- FileSystemPathDataFiles
- FileSystemPathUndoFiles
- IndexStatSaveSize
- IndexStatSaveScale
- IndexStatTriggerPct
- IndexStatTriggerScale
- IndexStatUpdateDelay
- RedoBuffer
Perform the following steps to instruct the data nodes to restart thendbmtdpods with the--initalflag after the rollback:Note:
If a setup is deployed with a prefix, then replace the pod name ofndbmtdaccordingly.- Run the following command to restart the
ndbmtdpods with--initalflag during rolling restart of data node, after the rollback:$ for x in {0..N}; do kubectl -n <cndbtier_namespace> exec -it <NDBMTD_STS_NAME_WITH_PREFIX>-$x -c db-backup-executor-svc -- bash -c "touch /var/occnedb/useinitial && chmod 777 /var/occnedb/useinitial"; done;For example, if a setup has fourndbmtdpods, use the following command:$ for x in {0..3}; do kubectl -n occne-cndbtier exec -it ndbmtd-$x -c db-backup-executor-svc -- bash -c "touch /var/occnedb/useinitial && chmod 777 /var/occnedb/useinitial"; done; - Verify if the
useiinitialfile is created for each data node in the/var/occnedbdirectory:$ for x in {0..N}; do kubectl -n <cndbtier_namespace> exec -it <NDBMTD_STS_NAME_WITH_PREFIX>-$x -c db-backup-executor-svc -- bash -c "echo ndbmtd-$x;ls -lrth /var/occnedb/ | grep -i useinitial "; done;For example:$ for x in {0..3}; do kubectl -n occne-cndbtier exec -it ndbmtd-$x -c db-backup-executor-svc -- bash -c "echo ndbmtd-$x;ls -lrth /var/occnedb/ | grep -i useinitial "; done;Sample output:ndbmtd-0 -rwxrwxrwx. 1 mysql mysql 0 Sep 23 11:56 useinitial ndbmtd-1 -rwxrwxrwx. 1 mysql mysql 0 Sep 23 11:56 useinitial ndbmtd-2 -rwxrwxrwx. 1 mysql mysql 0 Sep 23 11:56 useinitial ndbmtd-3 -rwxrwxrwx. 1 mysql mysql 0 Sep 23 11:56 useinitial
- If there is no upgrade service account configured for cnDBTier or if you are rolling
back to a previous cnDBTier, run the
pre-rollback-with-tde-to-non-tdescript:export OCCNE_NAMESPACE="occne-cndbtier" export NDB_MTD_PODS=$(kubectl -n ${OCCNE_NAMESPACE} get pods | grep 'ndbmtd' | awk '{print $1}') export NDB_MGMD_PODS=$(kubectl -n ${OCCNE_NAMESPACE} get pods | grep 'ndbmgmd' | awk '{print $1}') export NDB_STS_NAME=$(kubectl -n ${OCCNE_NAMESPACE} get sts | grep 'ndbmtd' | awk '{print $1}') occndbtier/files/hooks.sh --pre-rollback-with-tde-to-non-tde - If there is no upgrade service account configured for cnDBTier or if
you are rolling back to a previous cnDBTier release, then run the
pre-rollbackscript:export OCCNE_NAMESPACE="occne-cndbtier" export MYSQL_CONNECTIVITY_SERVICE="mysql-connectivity-service" export MYSQL_USERNAME="occneuser" export MYSQL_PASSWORD="<password for the user occneuser>" export DBTIER_REPLICATION_SVC_DATABASE="replication_info" export MYSQL_CMD="kubectl -n <namespace> exec <ndbmysqld-0/ndbappmysqld-0 pod name> -- mysql --binary-as-hex=0 --show-warnings" export NDB_MGMD_PODS="ndbmgmd-0 ndbmgmd-1" # To enable or disable password encryption , uncomment the below line and set the variable to true or false to enable or disable it. # export ENABLE_ENCRYPTION="<true/false>" occndbtier/files/hooks.sh --pre-rollback - Run the following command to roll back to the previous
version:
helm --namespace ${OCCNE_NAMESPACE} rollback $CNDBTIER_RELEASE_NAME $REVISIONWhere,$CNDBTIER_RELEASE_NAMEis the name of the cnDBTier deployment obtained from Step 3. For example,CNDBTIER_RELEASE_NAME="mysql-cluster"$REVISIONis the revision number of the release that you want to rollback to, obtained from Step 3. For example,REVISION=1.
- If there is no upgrade service account configured
for cnDBTier, run the
postrollbackscript. This script restarts the MGM pods and patchesupgradeStrategy.Note:
- Replace the values of the environment variables in the following sample with the values of your cluster.
- If prefix is used in pod names, then add the prefix to the
following environment variable for both the pod and statefulset names.
For
example:
export NDB_MGMD_PODS="<global.k8sResource.pod.prefix>-ndbmgmd-0 <global.k8sResource.pod.prefix>-ndbmgmd-1" export NDB_STS_NAME="<global.k8sResource.pod.prefix>-ndbmtd"
- Define the following variables before running the
postrollbackscript:export OCCNE_NAMESPACE="occne-cndbtier" export API_EMP_TRY_SLOTS_NODE_IDS="id=222\|id=223\|id=224\|id=225" #Ifvalues.global.ndbapp.ndb_cluster_connection_poolis greater than 1, then declare the following environment variable #export APP_CON_POOL_INGORE_NODE_IDS="id=100" export APP_CON_POOL_INGORE_NODE_IDS="id=100\|id=101\|id=102\|id=103" export NDB_MGMD_PODS="ndbmgmd-0 ndbmgmd-1" export NDB_MTD_PODS="ndbmtd-0 ndbmtd-1" export NDB_STS_NAME="ndbmtd" export API_STS_NAME="ndbmysqld" export APP_STS_NAME="ndbappmysqld" export NDBAPP_START_NODE_ID="70" export NDBAPP_REPLICA_MAX_COUNT="4" - Run the
postrollbackscript:occndbtier/files/hooks.sh --post-rollback
- Wait for the appropriate nodes to restart and reconnect to the cluster
and then run the following command to verify the cluster status from the management
pod. The pods that is restarted depends on the pods that are rolled
back.
$ kubectl -n ${OCCNE_NAMESPACE} exec -it ndbmgmd-0 -- ndb_mgm -e showSample output:Connected to Management Server at: localhost:1186 Cluster Configuration --------------------- [ndbd(NDB)] 2 node(s) id=1 @10.233.104.176 (mysql-8.4.2 ndb-8.4.2, Nodegroup: 0) id=2 @10.233.121.175 (mysql-8.4.2 ndb-8.4.2, Nodegroup: 0, *) [ndb_mgmd(MGM)] 2 node(s) id=49 @10.233.101.154 (mysql-8.4.2 ndb-8.4.2) id=50 @10.233.104.174 (mysql-8.4.2 ndb-8.4.2) [mysqld(API)] 8 node(s) id=56 @10.233.92.169 (mysql-8.4.2 ndb-8.4.2) id=57 @10.233.101.155 (mysql-8.4.2 ndb-8.4.2) id=70 @10.233.92.170 (mysql-8.4.2 ndb-8.4.2) id=71 @10.233.121.176 (mysql-8.4.2 ndb-8.4.2) id=222 (not connected, accepting connect from any host) id=223 (not connected, accepting connect from any host) id=224 (not connected, accepting connect from any host) id=225 (not connected, accepting connect from any host) - After rollback, run the Helm test on all the sites and verify if the
cnDBTier services are running
successfully:
$ helm test mysql-cluster --namespace ${OCCNE_NAMESPACE}Sample output:NAME: mysql-cluster LAST DEPLOYED: Mon May 20 10:22:58 2024 NAMESPACE: occne-cndbtier STATUS: deployed REVISION: 1 TEST SUITE: mysql-cluster-node-connection-test Last Started: Mon May 20 14:15:18 2024 Last Completed: Mon May 20 14:17:58 2024 Phase: Succeeded
5.4 Rolling Back cnDBTier from Non-TLS enabled to TLS Enabled Version
Note:
In this procedure, the cnDBTier sites are rolled back twice (in step 1 and step 3). Ensure that you follow this procedure as-is to roll back from a non-TLS version to a TLS enabled version.- Perform the Rolling Back cnDBTier procedure on all the cnDBTier sites one after the other to roll back cnDBTier to
a version where TLS mode is set to
"NONE". - After rolling back each site, run the following command on the site to
ensure that the replication is
UP:
$ kubectl -n <namespace> exec -it ndbmysqld-0 -- curl http://mysql-cluster-db-monitor-svc.<namespace>:8080/db-tier/status/replication/realtimewhere,
<namespace>is the namespace name of the of cnDBTier cluster.The value of replicationStatus in the output indicates if the local site is able to replicate data from that remote site:- "UP": Indicates that the local site is able to replicate data from that remote site.
- "DOWN": Indicates that the local site is not able to replicate data from the respective remote site.
For example, run the following command to check the georeplication status of cnDBTier cluster2 configured with other remote sites:$ kubectl -n cluster2 exec -it ndbmysqld-0 -- curl http://mysql-cluster-db-monitor-svc.cluster2:8080/db-tier/status/replication/realtimeSample output:[ { "localSiteName": "cluster2", "remoteSiteName": "cluster1", "replicationStatus": "UP", "secondsBehindRemote": 0, "replicationGroupDelay": [ { "replchannel_group_id": "1", "secondsBehindRemote": 0 } ] }, { "localSiteName": "cluster2", "remoteSiteName": "cluster3", "replicationStatus": "UP", "secondsBehindRemote": 0, "replicationGroupDelay": [ { "replchannel_group_id": "1", "secondsBehindRemote": 0 } ] }, { "localSiteName": "cluster2", "remoteSiteName": "cluster4", "replicationStatus": "UP", "secondsBehindRemote": 0, "replicationGroupDelay": [ { "replchannel_group_id": "1", "secondsBehindRemote": 0 } ] } ]In the sample output, the
replicationStatusis "UP" for thelocalSiteNamecluster2 forremotesiteNamecluster1, cluster3, and cluster4. This indicates that thelocalSiteNamecluster2 is able to replicate data fromremotesiteNamecluster1, cluster3, and cluster4. - Perform the Rolling Back cnDBTier procedure on all the cnDBTier sites one after the other to roll back cnDBTier to
a version where TLS mode is set to
"VERIFY_CA"or"VERIFY_IDENTITY". This configuration ensures that the clients use an encrypted connection and performs verification against the server CA certificate:VERIFY_CAinstructs the client to check if the server’s certificate is valid.VERIFY_IDENTITYinstructs the client to check if the server’s certificate is valid and to check if the host name used by the client, matches the identity in the server’s certificate.
- After rolling back each site, run the following command on the site to
ensure that the replication is
UP:
$ kubectl -n <namespace> exec -it ndbmysqld-0 -- curl http://mysql-cluster-db-monitor-svc.<namespace>:8080/db-tier/status/replication/realtimewhere,
<namespace>is the namespace name of the of cnDBTier cluster.The value ofreplicationStatusin the output indicates if the local site is able to replicate data from that remote site:- "UP": Indicates that the local site is able to replicate data from that remote site.
- "DOWN": Indicates that the local site is not able to replicate data from the respective remote site.
For example, see Step 2.
- Run the
hooks.shscript with the--add-ssltypeflag on all cnDBTier sites to set the type of"occnerepluser"as TLS:Note:
Update the values of the environment variables in the following code block as per your cluster.export OCCNE_NAMESPACE="occne-cndbtier" export MYSQL_CONNECTIVITY_SERVICE="mysql-connectivity-service" export MYSQL_USERNAME="occneuser" export MYSQL_PASSWORD="<password for the user occneuser>" export OCCNE_REPL_USER_NAME="occnerepluser" export MYSQL_CMD="kubectl -n <namespace> exec <ndbmysqld-0 pod name> -- mysql --binary-as-hex=0 --show-warnings" occndbtier/files/hooks.sh --add-ssltype
5.5 Rolling Back cnDBTier from TLS enabled to Non-TLS Enabled Version
Note:
In this procedure, the cnDBTier sites are rolled back twice (in step 2 and step 4). Ensure that you follow this procedure as-is to roll back from a TLS enabled version to a non-TLS version.- Run the
hooks.shscript with the-- remove-ssltypeflag on all cnDBTier sites to set the type of"occnerepluser"as Non-TLS:Note:
Update the values of the environment variables in the following code block as per your cluster.export OCCNE_NAMESPACE="occne-cndbtier" export MYSQL_CONNECTIVITY_SERVICE="mysql-connectivity-service" export MYSQL_USERNAME="occneuser" export MYSQL_PASSWORD="<password for the user occneuser>" export OCCNE_REPL_USER_NAME="occnerepluser" export MYSQL_CMD="kubectl -n <namespace> exec <ndbmysqld-0 pod name> -- mysql --binary-as-hex=0 --show-warnings" occndbtier/files/hooks.sh --remove-ssltype - Perform the Rolling Back cnDBTier procedure on all the cnDBTier sites one after the other to roll back cnDBTier to
a version where TLS mode is set to
"NONE". - After rolling back each site, run the following command on the site to
ensure that the replication is
UP:
$ kubectl -n <namespace> exec -it ndbmysqld-0 -- curl http://mysql-cluster-db-monitor-svc.<namespace>:8080/db-tier/status/replication/realtimewhere,
<namespace>is the namespace name of the of cnDBTier cluster.The value ofreplicationStatusin the output indicates if the local site is able to replicate data from that remote site:- "UP": Indicates that the local site is able to replicate data from that remote site.
- "DOWN": Indicates that the local site is not able to replicate data from the respective remote site.
For example, run the following command to check the georeplication status of cnDBTier cluster2 configured with other remote sites:$ kubectl -n cluster2 exec -it ndbmysqld-0 -- curl http://mysql-cluster-db-monitor-svc.cluster2:8080/db-tier/status/replication/realtimeSample output:[ { "localSiteName": "cluster2", "remoteSiteName": "cluster1", "replicationStatus": "UP", "secondsBehindRemote": 0, "replicationGroupDelay": [ { "replchannel_group_id": "1", "secondsBehindRemote": 0 } ] }, { "localSiteName": "cluster2", "remoteSiteName": "cluster3", "replicationStatus": "UP", "secondsBehindRemote": 0, "replicationGroupDelay": [ { "replchannel_group_id": "1", "secondsBehindRemote": 0 } ] }, { "localSiteName": "cluster2", "remoteSiteName": "cluster4", "replicationStatus": "UP", "secondsBehindRemote": 0, "replicationGroupDelay": [ { "replchannel_group_id": "1", "secondsBehindRemote": 0 } ] } ]In the sample output, the
replicationStatusis "UP" for thelocalSiteNamecluster2 forremotesiteNamecluster1, cluster3, and cluster4. This indicates that thelocalSiteNamecluster2 is able to replicate data fromremotesiteNamecluster1, cluster3, and cluster4. - Perform the Rolling Back cnDBTier procedure on all the cnDBTier sites one after the other to roll back cnDBTier to a version where TLS is disabled.
- After rolling back each site, run the following command on the site to
ensure that the replication is
UP:
$ kubectl -n <namespace> exec -it ndbmysqld-0 -- curl http://mysql-cluster-db-monitor-svc.<namespace>:8080/db-tier/status/replication/realtimewhere,
<namespace>is the namespace name of the of cnDBTier cluster.The value ofreplicationStatusin the output indicates if the local site is able to replicate data from that remote site:- "UP": Indicates that the local site is able to replicate data from that remote site.
- "DOWN": Indicates that the local site is not able to replicate data from the respective remote site.
For example, see Step 3.