Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Normal
Fix Version/s: 4.13.0, 4.12.0, 4.11.0, 4.14.0
Affects Version/s: 4.9
Component/s: Documentation
Labels:
- DevToolsDocs:Triaged
- migrated_from_bz

Severity:
Moderate
Regression:
None
Story Points:
3
Sprint:
OSDOCS Sprint 233, OSDOCS Sprint 234, OSDOCS Sprint 235, OSDOCS Sprint 237, OSDOCS Sprint 238, OSDOCS Sprint 236, OSDOCS Sprint 239, OSDOCS Sprint 241, OSDOCS Sprint 243
sprint_count:
9
Architecture:

Unspecified
Release Note Text:
N/A
Release Note Type:
Release Note Not Required
Internal Whiteboard:
Target Version:

4.6.z

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Document URL:
https://docs.openshift.com/container-platform/4.9/backup_and_restore/graceful-cluster-shutdown.html

Section Number and Name:
Shutting down the cluster, Point #2

Describe the issue:
We are currently advising the customers to run a for loop against all of the Openshift nodes and forcing a Shutdown without placing them in Schedulable at false and draining them.

Suggestions for improvement:

I would recommend the following approach, since it will place all of the nodes in Schedulable at false and will also drain all of the worker nodes:

#2 Mark the nodes unschedulable before performing the pod evacuation.
```
for node in $(oc get nodes -o jsonpath='

{.items[*].metadata.name}'); do echo ${node} ; oc adm cordon ${node} ; done
```

#3 Evacuate the pods using the following method:
```
for node in $(oc get nodes -l node-role.kubernetes.io/worker -o jsonpath='{.items[*].metadata.name}

'); do echo ${node} ; oc adm drain ${node} --delete-emptydir-data --ignore-daemonsets=true --timeout=15s ; done
```

#4 Shut down all of the nodes in the cluster. You can do this from your cloud provider’s web console, or run the following loop:

```
for node in $(oc get nodes -o jsonpath='

{.items[*].metadata.name}

'); do oc debug node/${node} – chroot /host shutdown -h 1 ; done
```

Additional information:

I think this approach will provide a better outcome, since it will ensure the nodes are set to unschedulable, therefore preventing any workload to be scheduled on them so when one node is being shutdown, Kubernetes won't try to schedule workload on a node that's about to be shutdown.
I also think this could have a positive outcome on ETCD.
Thanks

links to

openshift/openshift-docs#45621: WIP OCPBUGS-9229: Graceful Shutdown improvements

Assignee:: Neal Alhadeff (Inactive)

Reporter:: Filipe Santos

QA Contact:: Min Li

Doc Contact:: Latha Sreenivasa Murthy

Contributing Groups:: Red Hat Employee

Need Info From:: Sunil Choudhary

Votes:: 0 Vote for this issue

Watchers:: 8 Start watching this issue

Created:: 2022/04/18 5:16 PM

Updated:: 2023/11/01 7:09 PM

Resolved:: 2023/11/01 7:09 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates