-
Bug
-
Resolution: Unresolved
-
Normal
-
None
-
4.19.z
-
Quality / Stability / Reliability
-
False
-
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
-
None
Description of problem:
Unreachable nodes with restrictive PodDisruptionBudgets cannot be drained and remain stuck in deletion.
Version-Release number of selected component (if applicable):
4.19.z
How reproducible:
100%
Steps to Reproduce:
1. Create a ROSA HCP cluster with its default NodePool (2 workers) 2. Stop both workers in the AWS console 3. These nodes are marked as unhealthy by MHC and stuck deleting permanently due to PDBs.
Actual results:
Worker nodes can't be replaced by MHC as they are stuck deleting permanently
Expected results:
Worker nodes should successfuly be replaced by MHC
Additional info:
This seems to be a CAPI bug that I believe was introduced with the drain refactor in CAPI commit 3232abcf3 where CAPI moved from using kubectl's drain logic to its own implementation. Must-gather attached in comments (contains HCP dump, dataplane dump).