-
Feature Request
-
Resolution: Unresolved
-
Normal
-
None
-
None
-
False
-
None
-
False
-
Not Selected
-
-
1. Proposed Title of this Feature Request:
Enhance Safeguards Against Unintended Node Auto-Removal in OpenShift Clusters
2. What is the Nature and Description of the Request?
The request is to introduce additional safeguards and logging mechanisms within OpenShift to prevent unintended node object removals triggered by cloud controller managers (CCM) or other external factors. Specifically, this includes:
Implementing confirmation or validation steps before node removal when initiated by external systems like vCenter or other cloud providers.
Enhancing diagnostic capabilities to capture detailed logs and events related to node lifecycle operations, particularly when nodes are marked as "nonexistent" by the CCM.
3. Why Does the Customer Need This? (Business Requirements)
Unintended node removals can disrupt workloads, cause cluster instability, and impact service availability.
Enhanced logging and safeguards will provide better visibility into node lifecycle events, enabling administrators to diagnose and resolve issues promptly.
Introducing safeguards ensures that OCP remains resilient to external factors like cloud provider miscommunications, reducing the risk of operational outages.
Preventing unintended behaviors strengthens customer confidence in OCP as a reliable and robust platform for critical applications.
4. List Any Affected Packages or Components:
Cloud Controller Manager (CCM)
Node Lifecycle Controller
Additional Information: https://issues.redhat.com/browse/OCPBUGS-42841