-
Bug
-
Resolution: Done
-
Major
-
None
-
None
Fix 2 issues
- (#253) When a timeout remediation CR is deleted, the operator currently fails to remove the associated NoExecute taints that were placed on the node during the remediation process. These taints are used to evict workloads and isolate the unhealthy node. Leaving these taints in place means the node is never fully returned to a schedulable state, leaving the node remediation incomplete even after the CR has been removed.
- (#249) SNR should allow removal of redundant remediations - a remediation of a node that no longer exists, since MDR deleted the node and triggered a provisioning of a new one.
Currently, there is a bug in the code that prevents SNR from reaching that logic; this PR fixes it. When SNR detects a remediation that is both deleted and marked as Timeout by NHC, it should remove the finalizer.