How to Safely Drain a Node in Kubernetes

This post will help us learn how to properly drain a node in Kubernetes to prepare for maintenance.

In this Kubernetes tutorial, you will learn to drain a node using kubectl drain command to prepare for maintenance.

It is as simple as entering this command:

kubectl drain node_name

As a matter of fact, you can get the nodes details using kubectl get nodes command.

However, there is more to draining nodes in Kubernetes so let’s take a detailed look at it.

Why do you need to drain nodes?

The reason is because Kubernetes is designed to be fault tolerant of worker node failures.

There might be different reasons a worker node becomes unusable. The reason can be; because of a hardware problem, a cloud provider problem. Another reason; if there are network issues between worker and master node, the Kubernetes master handles it effectively.

On the other hand, that doesn’t mean it will always be the case. And this is when you need to drain the nodes and remove all the pods.

The draining is the process for safely evicting all the pods from a node. This way, the containers running on the pod terminate gracefully.

How to properly drain nodes in Kubernetes

Let’s start with the practical demonstration.

Step 1: Mark the node as unschedulable (cordon)

To perform maintenance on a node, you should unschedule and then drain a node.

First have a look at the currently running nodes:

root@kmaster-rj:~# kubectl get nodes
NAME          STATUS   ROLES    AGE   VERSION
kmaster-rj    Ready    master   44d   v1.18.8
kworker-rj1   Ready    <none>   44d   v1.18.8
kworker-rj2   Ready    <none>   44d   v1.18.8
root@kmaster-rj:~#

Look at the pods running on different nodes:

root@kmaster-rj:~# kubectl get pods -o wide
NAME                      READY   STATUS    RESTARTS   AGE     IP              NODE          NOMINATED NODE   READINESS GATES
my-dep-557548758d-gprnr   1/1     Running   1          4d23h   172.16.213.48   kworker-rj1   <none>           <none>
my-dep-557548758d-d2pmd   1/1     Running   1          4d15h     172.16.213.57   kworker-rj2   <none>           <none>
pod-delete-demo           1/1     Running   1          2d      172.16.213.56   kworker-rj1   <none>           <none>
root@kmaster-rj:~#

Now mark the node as unschedulable by running the following command:

root@kmaster-rj:~# kubectl cordon kworker-rj2
node/kworker-rj2 cordoned
root@kmaster-rj:~#

List the nodes again:

root@kmaster-rj:~# kubectl get nodes
NAME          STATUS                     ROLES    AGE   VERSION
kmaster-rj    Ready                      master   44d   v1.18.8
kworker-rj1   Ready                      <none>   44d   v1.18.8
kworker-rj2   Ready,SchedulingDisabled   <none>   44d   v1.18.8
root@kmaster-rj:~#

You can notice that the node kworker-rj2 is now labeled as SchedulingDisabled.

Till this step it doesn’t evict the pods running on that node. Verify the pod status:

root@kmaster-rj:~# kubectl get pods -o wide
NAME                      READY   STATUS    RESTARTS   AGE     IP              NODE          NOMINATED NODE   READINESS GATES
my-dep-557548758d-gprnr   1/1     Running   1          4d23h   172.16.213.48   kworker-rj1   <none>           <none>
my-dep-557548758d-d2pmd   1/1     Running   1          4d15h     172.16.213.57   kworker-rj2   <none>           <none>
pod-delete-demo           1/1     Running   1          2d      172.16.213.56   kworker-rj1   <none>           <none>
root@kmaster-rj:~#

You can see that pod “my-dep-557548758d-d2pmd” still running on kworker-rj2 node.

Step 2: Drain the node to prepare for maintenance

Now drain the node in preparation for maintenance to remove pods that are running on the node by running the following command:

root@kmaster-rj:~# kubectl drain kworker-rj2 --grace-period=300 --ignore-daemonsets=true
node/kworker-rj2 already cordoned
WARNING: ignoring DaemonSet-managed Pods: kube-system/calico-node-fl8dl, kube-system/kube-proxy-95vdf
evicting pod default/my-dep-557548758d-d2pmd
pod/my-dep-557548758d-d2pmd evicted
node/kworker-rj2 evicted
root@kmaster-rj:~#

NOTE: kubectl drain cannot delete Pods not managed by ReplicationController, ReplicaSet, Job, DaemonSet or StatefulSet. You need to use –force to override that and by doing that the individual pods will be deleted permanently.

Now look at the pods:

root@kmaster-rj:~# kubectl get pods -o wide
NAME                      READY   STATUS    RESTARTS   AGE     IP              NODE          NOMINATED NODE   READINESS GATES
my-dep-557548758d-gprnr   1/1     Running   1          4d23h   172.16.213.48   kworker-rj1   <none>           <none>
my-dep-557548758d-dsanh   1/1     Running   0          27s     172.16.213.38   kworker-rj1   <none>           <none>
pod-delete-demo           1/1     Running   1          2d      172.16.213.56   kworker-rj1   <none>           <none>
root@kmaster-rj:~#

The pod which was running on kworker-rj2 node evicted from there and started as a new pod on kworker-rj1 node.

Nodes status remains the same:

root@kmaster-rj:~# kubectl get nodes
NAME          STATUS                     ROLES    AGE   VERSION
kmaster-rj    Ready                      master   44d   v1.18.8
kworker-rj1   Ready                      <none>   44d   v1.18.8
kworker-rj2   Ready,SchedulingDisabled   <none>   44d   v1.18.8
root@kmaster-rj:~#

Step 3: Uncordon the node after maintenance completes

You need to run following command afterwards to tell Kubernetes that it can resume scheduling new pods onto the node.

root@kmaster-rj:~# kubectl uncordon kworker-rj2
node/kworker-rj2 uncordoned

Verify the node status:

root@kmaster-rj:~# kubectl get nodes
NAME          STATUS   ROLES    AGE   VERSION
kmaster-rj    Ready    master   44d   v1.18.8
kworker-rj1   Ready    <none>   44d   v1.18.8
kworker-rj2   Ready    <none>   44d   v1.18.8

Node kworker-rj2 becomes ready again to handle new workloads.

How to Safely Drain a Node in Kubernetes

Why do you need to drain nodes?

How to properly drain nodes in Kubernetes

Step 1: Mark the node as unschedulable (cordon)

Step 2: Drain the node to prepare for maintenance

Step 3: Uncordon the node after maintenance completes

Related searches

10 Best Udemy Docker and Kubernetes Courses & Tutorials

Serverless Computing vs. Containers | How to Make a Decision

Top Kubernetes for Absolute Beginners – Udemy Hands-on Free Course

Container 101 Tutorials: Kubernetes Technology

10 Free Kubernetes Courses Online Tutorials [Udemy]

Docker Mastery Courses: Udemy Kubernetes +Swarm from a Docker Captain

TRENDING POSTS

Mastering Online Gaming: Video, Mobile Games Programming and Development

Community Government in Belgium: Significance, Services, Powers, Importance and Roles

Deemed University: Admission Advantages and Eligibility Criteria

Online Proctored Exam – What you need to know as a Student

Mark Zuckerberg Meta AI Chatbot, Competing with OpenAI ChatGPT

FzMovies HD Download for Hollywood, Bollywood, Nollywood Movies in 3gp, AVI and Mp4 Formats

FZMovies.net Website – Free Download Latest Movies & TV Shows

Global Crackdown: Crypto and Fintech Firms Hit with $5.8B in Fines

Why should the Human Resource Profession have Technology Skills

Significance of Reading Skills and their Advantages in Children and Adults

Related Stories

10 Best Udemy Docker and Kubernetes Courses & Tutorials

Kubernetes Cluster Deployment on CentOS (and Other Linux)

Linux Operating Systems for Kubernetes – OS Support

Kubectl command – How to Add or Remove Labels to Nodes in Kubernetes

Serverless Computing vs. Containers | How to Make a Decision

DevOps and Containers | Benefits, Metrics, Challenges & Risk Management

Talos OS v0.7 Platform – Modern Systems Kubernetes

Containers 101 Rackspace Class: What is Container Technology & Kubernetes and Why do we need them?

[Udemy] Learn DevOps: The Complete Kubernetes Course

DevOps RoadMap & Courses to Becoming a Certified Engineer