-
Bug
-
Resolution: Unresolved
-
Undefined
-
rhel-9.6
-
None
-
resource-agents-4.16.0-44.el10
-
None
-
Moderate
-
OtherQA, ZStream
-
rhel-ha
-
13
-
26
-
3
-
False
-
False
-
-
No
-
None
-
Regression Exception
-
Requested
-
None
-
Unspecified Release Note Type - Unknown
-
Unspecified
-
Unspecified
-
Unspecified
-
None
Currently, if the etcd container managed by podman-etcd is abruptly terminated, the monitor operation returns OCF_NOT_RUNNING. This is received by Pacemaker as the resource was never running, which triggers an immediate, local restart of the agent.
This restart is too quick and uncoordinated with the peer node. The agent attempts to rejoin a cluster that hasn't yet recognized the failure,
leading to inconsistent state detection (e.g., seeing 2 active nodes) and causing the start operation to fail or deadlock.