Loading...

Linking RHIVOS CVEs to...

Migration: Automation ...

SWIFT: Generate New Ti...

SWIFT: POC Conversion

Sync from "Extern...

XML

Word

Printable

Type: Bug
Resolution: Done-Errata
Priority: Critical
Fix Version/s: rhel-9.4
Affects Version/s: rhel-9.2.0, rhel-9.3.0
Component/s: pacemaker
Labels:
- fixed_upstream

Fixed in Build:
pacemaker-2.1.7-4.el9
Regression:
Yes
Severity:
Important
Keywords:

ZStream, Regression

AssignedTeam:
rhel-ha
Sub-System Group:

ssg_filesystems_storage_and_HA

Dev Target Milestone:
22
Internal Target Milestone:
26
Story Points:
8
ACKs Check:

QE ack, Dev ack
Target Version:

rhel-9.4
Blocked:
False
Ready:
False
Blocked Reason:

Hide

None

Show
None
Product Documentation Required:
None
Products:

Red Hat Enterprise Linux
Sprint:
None
Release Blocker:
Approved Blocker

Preliminary Testing:
Pass
Testable Builds:
https://brewweb.engineering.redhat.com/brew/buildinfo?buildID=2882034
Errata Link:
https://errata.engineering.redhat.com/advisory/125612
Test Coverage:

RegressionOnly

Experience:
Architecture:

All
OS:
All
Target Upstream Version:
2.1.7

PX Impact Score:
PX Impact Range:
PX Priority Data:
SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Planning:
None

What were you trying to do that didn't work?

I tried to remove the stonith devices and stop the cluster, so I could setup sbd.

Please provide the package NVR for which bug is seen:

since pacemaker-2.1.6-7.el9.x86_64

How reproducible:

Sometimes, 50% chance

Steps to reproduce

setup two node cluster
check out which node is a DC

on a DC node: remove the stonith devices and stop the cluster (

pcs stonith delete fence-virt-252; pcs stonith delete fence-virt-253; pcs cluster stop --all

)

Expected results

Stonith devices are deleted, cluster stops.

Actual results

Cluster is stuck while stopping:

[root@virt-253 ~]# pcs stonith delete fence-virt-252; pcs stonith delete fence-virt-253; pcs cluster stop --all
Attempting to stop: fence-virt-252... Stopped
Attempting to stop: fence-virt-253... Stopped
virt-252: Stopping Cluster (pacemaker)...

[root@virt-253 ~]# pcs status --full
Cluster name: STSRHTS14392

WARNINGS:
No stonith devices and stonith-enabled is not false

Cluster Summary:
  * Stack: corosync (Pacemaker daemons are shutting down)
  * Current DC: virt-253 (2) (version 2.1.6-9.el9-6fdc9deea29) - MIXED-VERSION partition with quorum
  * Last updated: Fri Oct 13 13:16:22 2023 on virt-253
  * Last change:  Fri Oct 13 13:15:18 2023 by root via cibadmin on virt-252
  * 2 nodes configured
  * 0 resource instances configured

Node List:
  * Node virt-252 (1): pending, feature set <3.15.1
  * Node virt-253 (2): online, feature set 3.17.4

Full List of Resources:
  * No resources

Migration Summary:

Tickets:

PCSD Status:
  virt-252: Online
  virt-253: Online

Daemon Status:
  corosync: active/enabled
  pacemaker: inactive/enabled
  pcsd: active/enabled

After 15 minutes when cluster is stuck (`cluster-recheck-interval` I assume) cluster finally stops.

I created a crm_report from the incident and attached it. The cluster got stuck on the stop action around Oct 13 13:15

cluster-froze-when-stop.tar.bz2

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

cluster-froze-when-stop.tar.bz2
105 kB
2023/10/13 3:49 PM

is cloned by

RHEL-23082 Avoid "shutdown" node attribute persisting after shutdown [rhel-10]

In Progress

links to

After rebooting a controller node the `rabbitmq` services is not starting and we see the following in `pcs status`:

ClusterLabs T137

RHBA-2023:125612 pacemaker bug fix and enhancement update

Assignee:: Marketa Smazova

Reporter:: Marketa Smazova

Developer:: Kenneth Gaillot (Inactive)

QA Contact:: Marketa Smazova

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Created:: 2023/10/13 3:51 PM

Updated:: 2025/09/13 5:14 PM

Resolved:: 2024/04/30 9:32 AM

Dev Target end:: 2024/01/29

Target end:: 2024/02/26

Release Date:: 2024/04/30

Details

Description

What were you trying to do that didn't work?

Please provide the package NVR for which bug is seen:

How reproducible:

Steps to reproduce

Expected results

Actual results

Attachments

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates