Loading...

Linking RHIVOS CVEs to...

Migration: Automation ...

SWIFT: POC Conversion

Sync from "Extern...

XML

Word

Printable

Type: Bug
Resolution: Unresolved
Priority: Normal
Fix Version/s: None
Affects Version/s: rhel-9.6
Component/s: pcs
Labels:
None

Regression:
No
Severity:
Low

AssignedTeam:
rhel-ha

Story Points:
2
Blocked:
False
Ready:
False
Blocked Reason:

Hide

None

Show
None
Product Documentation Required:
None
Sprint:
None

Preliminary Testing:
None
Test Coverage:
None

ProdDocsReview-CCS:
Unspecified
ProdDocsReview-Dev:
Unspecified
ProdDocsReview-QE:
Unspecified

Experience:

PX Impact Score:
SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Planning:
None

What were you trying to do that didn't work?

Run regression test with simple setup & start cluster on upstream CI machines.

Probable reason formulated by pcs developer: pcs starts corosync + pcmk on all nodes and then checks whether the nodes have started. That's done by looking for each node status in crm_mon xml output. However, it looks like the check happens sooner than the started node actually appear in the xml. Probably caused by the nodes being slow to start.

Please provide the package NVR for which the bug is seen:

seen in pcs-0.11.9-99+git.57.ed947.el9

How reproducible is this bug?:

rarely, we couldn't reproduce the issue on our QA and engineering beaker instances, only occurrence is happening on upstream CI machines and not always.

Steps to reproduce

run pcs regression test pcs,cli,Setup on upstream CI machines. The test will call "pcs cluster start --all --wait" after setup.

Expected results

snippet from the command with --debug option:

--Debug Communication Output End--
node02: Starting Cluster...
node03: Starting Cluster...
Waiting for node(s) to start...
Sending HTTP Request to: https://node01:2224/remote/pacemaker_node_status
Data: None
Sending HTTP Request to: https://node03:2224/remote/pacemaker_node_status
Data: None
Sending HTTP Request to: https://node02:2224/remote/pacemaker_node_status
Data: None
Response Code: 400
--Debug Response Start--
Error: Node 'node03' does not appear to exist in configuration

Actual results

The command "pcs cluster start --all --wait" counts with slower machines.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

FAILED_rhel9-zstream_yes-upstream_stable-nodes_3-pcs_cli_Setup.tar.xz
22.92 MB
2025/04/01 3:18 PM

Assignee:: Tomas Jelinek

Reporter:: Michal Mazourek

Developer:: Tomas Jelinek

QA Contact:: Cluster QE

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Created:: 2025/04/01 3:17 PM

Updated:: 2025/10/07 2:07 PM

Stale Date:: 2026/03/31

Details

Description

What were you trying to do that didn't work?

Please provide the package NVR for which the bug is seen:

How reproducible is this bug?:

Steps to reproduce

Expected results

Actual results

Attachments

Attachments

Easy Agile Planning Poker

Activity

People

Dates