Loading...

XML

Word

Printable

Type: Bug
Resolution: Not a Bug
Priority: Normal
Fix Version/s: None
Affects Version/s: 4.20.z
Component/s: Two Node Fencing
Labels:
None

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
Important
Regression:
None
Epic Link:
TNF UX

Target Backport Versions:
None
Target Version:
None
Release Blocker:
None
Sprint:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Impact Score:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

As a developer of OCPBUGS, I need:

To ensure that we try to restore the fencing devices after they've failed on loop.
This involves running the `pcs stonith cleanup` command on the nodes if we discover that the fence device will no longer be called OR preventing pacemaker from giving up entirely.

Acceptance Criteria

We have a mechanism merged into cluster-etcd-operator that tries to cleanup the fencing resources after pulling their status (if they are marked as infinite failures) or we update the fencing resource to prevent pacemaker from giving up entirely.
The operator is updated to mark itself degraded if we are in this state.

Supporting Documents

Issue synthesized with help from gemini Engineering Jira Buddy gem

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

ocpbugs-62067-walkthrough.txt
4 kB
2025/09/22 5:26 PM

Assignee:: Unassigned

Reporter:: Jeremy Poulin

Need Info From:: None

Contributors:: None

QA Contact:: Douglas Hensel

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Created:: 2025/09/22 5:05 PM

Updated:: 2025/10/28 4:29 PM

Resolved:: 2025/10/28 4:29 PM