-
Story
-
Resolution: Unresolved
-
Minor
-
None
-
False
-
-
False
-
-
After having some concerns about supporting the off action for FAR, we have created a doc on how to address that, and we agreed that we can document how to add it manually.
Add some documentation on how to troubleshoot/investigate unhealthy nodes with some manual steps.
- This kind of investigation can be done by raising a "flag"/hold a "lock" prior to fencing and after the investigation is over, see examples below.
- Add a unique taint to a node for no scheduling (e.g., medik8s.io/fence-agents-remediation-investigation=begin:NoSchedule)
- Cordon the node (similar to NMO functionality but without draining the node)
- Hold a unique lease with the node name (different than the one NHC and NMO are trying using)
This task is for FAR but is related to other remediators as SNR.
- is triggered by
-
RHWA-285 FAR: Support off Action
-
- Closed
-