Terminating and Replacing Worker Nodes

Find out how to terminate and replace a worker node in a Kubernetes cluster that you've created using Kubernetes Engine (OKE).

Note

You can only cycle nodes to terminate and replace worker nodes when using enhanced clusters. See Working with Enhanced Clusters and Basic Clusters.

You can cycle nodes to terminate and replace nodes with both virtual machine shapes and bare metal shapes.

You can cycle nodes to terminate and replace managed nodes.

Sometimes, terminating and replacing managed nodes is the best way to resolve an issue with the compute instances hosting the nodes. In particular, where an issue can be resolved simply by terminating an existing instance and replacing it with a new instance that has the same properties, or that has different properties derived from changed node pool properties (such as a changed host OS, or a changed compute shape). For example:

To address any configuration drift that might have occurred since the instance was originally launched.
To address any underlying hardware faults.

Using Kubernetes Engine, you can terminate and replace the compute instances hosting managed nodes in the following ways:

You can cycle the node pool containing the managed nodes, and select the Replace nodes option, as described in this section.
You can cycle and replace specific managed nodes, as described in this section.
You can delete a specific managed node. Provided that you do not indicate that you want node deletion to scale down the node pool, the node that you delete is replaced with a new node (see Deleting Worker Nodes).

When you cycle and terminate and replace a managed node, Kubernetes Engine automatically cordons and drains the worker node before terminating it. The compute instance hosting the managed node is terminated and a new instance is created. Note that the new instance has a new OCID and network address.

If you cycle all the managed nodes in a node pool to terminate and replace them, when new instances have a Running state, any updates to node pool properties are applied to all of the worker nodes in the node pool. Note that if you cycle an individual managed node to terminate and replace it, any updates to node pool properties are applied to the replacement node.

As well as enabling you to perform routine worker node maintenance, terminating and replacing managed nodes can also be useful when you want to:

Update managed node properties (see Updating Worker Nodes in an Existing Node Pool by Terminating and Replacing Nodes).
Upgrade the Kubernetes version running on managed nodes (see Upgrading Managed Nodes by Terminating and Replacing Nodes).

Note the following considerations when cycling to terminate and replace worker nodes:

You can select a managed node pool to cycle, terminate, and replace all the managed nodes within it. You can also cycle, terminate, and replace individual managed nodes.
You cannot cycle self-managed nodes to terminate and replace them.

Balancing service availability and cost when terminating and replacing managed nodes in node pools

When you cycle all the managed nodes in a node pool to terminate and replace them, Kubernetes Engine uses the Cordon and drain settings specified for the node pool, and follows two strategies:

Create new (additional) nodes, and then remove existing nodes: Kubernetes Engine adds an additional node (or nodes) to the node pool with updated properties. When the additional node is active, Kubernetes Engine cordons an existing node, drains the node, and removes the node from the node pool. This strategy maintains service availability, but costs more.
Remove existing nodes, and then create new nodes: Kubernetes Engine cordons an existing node (or nodes) to make it unavailable, drains the node, and removes the node from the node pool. When the node has been removed, Kubernetes Engine adds a new node to the node pool to replace the node that has been removed. This strategy costs less, but might compromise service availability.

To tailor Kubernetes Engine behavior to meet your own requirements for service availability and cost, control and balance the two strategies by specifying values for maxSurge and maxUnavailable. For more information, see Balancing Service Availability and Cost When Cycling Managed Nodes in Node Pools.

Cordoning and draining when terminating and replacing nodes

When you select a node pool and specify that you want to terminate and replace its worker nodes, Kubernetes Engine automatically cordons, drains, and terminates the existing managed nodes. Kubernetes Engine uses the Cordon and drain options specified for the node pool.

When you select an individual managed node and specify that you want to terminate and replace it, you can specify Cordon and drain options. The Cordon and drain options you specify for the managed node override the Cordon and drain options specified for the node pool.

For more information, see Cordoning and Draining Managed Nodes Before Shut Down or Termination