-
Story
-
Resolution: Unresolved
-
Normal
-
None
-
rhel-9.0.0
-
rhel-sst-high-availability
-
ssg_filesystems_storage_and_HA
-
13
-
False
-
-
None
-
None
-
None
-
None
-
Enhancement
-
-
-
Unspecified
-
None
Description of problem:
Requesting additional configurable failure recovery options for pacemaker managed resources.
For example a customer requested:
"RFE to add something like a retry and/or retry_attempts option for pacemaker
resource monitor operations."
Version-Release number of selected component (if applicable):
Latest 8.5 pacemaker
How reproducible:
Does not apply
Steps to Reproduce:
Does not apply
Actual results:
Currently a monitor failure of a resource results in pacemaker performing the "on-fail" value (restart, ignore, fence, etc).
Expected results:
Provide more options to pacemaker to handle monitor resource failures such as "retry X times before considering the resource monitor a failure".
Additional info:
We spoke with engineering about this issue and they state there are some other bugzilla that are related to this RFE:
- 1747559 – Allow operation failure timeouts to be configured per operation in Pacemaker
https://bugzilla.redhat.com/show_bug.cgi?id=1747559
- 1328448 – RFE: start-failure-is-fatal as per-resource parameter instead of global property
https://bugzilla.redhat.com/show_bug.cgi?id=1328448