Loading...

XML

Word

Printable

Type: Bug
Resolution: Unresolved
Priority: Minor
Fix Version/s: None
Affects Version/s: rhel-9.4
Component/s: pacemaker
Labels:
None

Regression:
No
Severity:
Low

Pool Team:

rhel-sst-high-availability

Story Points:
5
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Product Documentation Required:
None
Sprint:
None

Preliminary Testing:
None
Test Coverage:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Planning:
None

What were you trying to do that didn't work? When removing a location constraint and reducing clone-max for a clone set, it's expected that a STOP action to be triggered only for the host. However, Pacemaker would issue a MOVE of the resource from one host to another and STOP the resource on another host. This was observed in the pacemaker.log file:

Initially the clone resource db2_mount_db2fs_sharedDb has 2 location constrains on 2 hosts kmbok-srv-3 and kmbok-srv-4 and clone-max set to 2.

1. First the location constrain for host kmbok-srv-3 was deleted:

Dec 04 13:05:16.842 kmbok-srv-1 pacemaker-based [2250947] (cib_process_request) info: Completed cib_delete operation for section //rsc_location[@id='pref-db2_mount_db2fs_sharedDb-runsOn-kmbok-srv-3']: OK (rc=0, origin=kmbok-srv-3/cibadmin/2, version=0.1785.0)

2. Then the clone-max value for the clone resource reduced (from 2) to 1:

Dec 04 13:05:16.902 kmbok-srv-1 pacemaker-based [2250947] (log_info) info: + /cib/configuration/resources/clone[@id='db2_mount_db2fs_sharedDb-clone']/meta_attributes[@id='db2_mount_db2fs_sharedDb-meta_attributes']/nvpair[@id='db2_mount_db2fs_sharedDb-meta_attributes-clone-max']: @value=1

3. Initially Pacemaker decided to only issue the STOP action only kmbok-srv-3:
Dec 04 13:05:16.932 kmbok-srv-1 pacemaker-schedulerd[2250951] (log_list_item) notice: Actions: Stop db2_mount_db2fs_sharedDb:0 ( kmbok-srv-3 ) due to node availability

3. But then Pacemaker decided to do something unexpected: Move the clone resource from kmbok-srv-3 to kmbok-srv-4, and issue a stop on kmbok-srv-4:

Dec 04 13:05:16.973 kmbok-srv-1 pacemaker-schedulerd[2250951] (log_list_item) notice: Actions: Move db2_mount_db2fs_sharedDb:0 ( kmbok-srv-3 -> kmbok-srv-4 )

Dec 04 13:05:16.973 kmbok-srv-1 pacemaker-schedulerd[2250951] (log_list_item) notice: Actions: Stop db2_mount_db2fs_sharedDb:1 ( kmbok-srv-4 ) due to node availability

What is the impact of this issue to you? The unexpected STOP and START for the other host caused issues with Db2 application since it attempted to unmount the filesystem unexpectedly.

Please provide the package NVR for which the bug is seen: 2.1.6-4

How reproducible is this bug?: Easy to reproduce

Steps to reproduce

Create a cluster with 4 nodes
Create a clone set with location constraint defined on 2 hosts, and clone-max set to 2. This is to control the location of the resource where it can be active on.
Remove location constraint on one host and reduce clone-max to 1.

Expected results: Expect that the STOP action to be issued for the host that the location constraint is removed.

Actual results: Pacemaker issued a MOVE of the resource from the removing host, and STOP action on the remaining host.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

unexpected-move-action-for-clone.tar.bz2
24.16 MB
2024/12/04 9:57 PM

links to

ClusterLabs T963

Assignee:: Christopher Lumens

Reporter:: Lan Pham

Contributing Groups:: IBM Confidential Group

Developer:: Kenneth Gaillot

QA Contact:: Cluster QE

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Created:: 2024/12/04 9:56 PM

Updated:: 2024/12/23 11:23 PM

Details

Description

What is the impact of this issue to you? The unexpected STOP and START for the other host caused issues with Db2 application since it attempted to unmount the filesystem unexpectedly.

Please provide the package NVR for which the bug is seen: 2.1.6-4

How reproducible is this bug?: Easy to reproduce

Steps to reproduce

Expected results: Expect that the STOP action to be issued for the host that the location constraint is removed.

Actual results: Pacemaker issued a MOVE of the resource from the removing host, and STOP action on the remaining host.

Attachments

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates