Loading...

XML

Word

Printable

Type: Bug
Resolution: Done-Errata
Priority: Normal
Fix Version/s: rhos-16.2.9
Affects Version/s: rhos-16.2.z
Component/s: tripleo-ansible
Labels:
None

Story Points:
2
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Docs Approval:
?
Fixed in Build:
tripleo-ansible-0.8.1-2.20250429125109.123ce73.el8ost
AssignedTeam:
rhos-ops-day1day2-upgrades
Regression:
None
Release Note Text:

Hide
.Summary:
Galera database can not be backed it up when more than one galera container is running on the server.

Cause -
When there are several galera containers are running on the server the procedure is not able to get the correct id of the container to execute the command.

Consequence -
The command is executed on the wrong container and it fails.

Workaround -
Clean up all the orphan containers and just let the galera pacemaker container running before doing the backup

Result –

Show
.Summary: Galera database can not be backed it up when more than one galera container is running on the server. Cause - When there are several galera containers are running on the server the procedure is not able to get the correct id of the container to execute the command. Consequence - The command is executed on the wrong container and it fails. Workaround - Clean up all the orphan containers and just let the galera pacemaker container running before doing the backup Result –
Release Note Type:
Known Issue
Intelligence Requested:
Market:
Errata Link:
https://errata.engineering.redhat.com/advisory/149919

Severity:
Moderate

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

To Reproduce Steps to reproduce the behavior:
Customer had an orphaned galera container on controller node during backup. As a result, " Get the mysql container id when galera is enabled" play returned two container IDs. This blocked backup process: "Galera desync the MySQL node" play failed with the following output:

fatal: [controller02]: FAILED! => {"attempts": 300, "changed": true, "cmd": "set -o pipefail\npodman exec 8862d8d227b8\n1c9d9dd69dd3 bash -c \"mysql -p -u root \\\n-pPASSWORD --execute 'SET GLOBAL wsrep_desync = ON'\"\n", "delta": "0:00:00.145099", "end": "TS", "msg": "non-zero return code", "rc": 127, "start": "TS0", "stderr": "Error: must provide a non-empty command to start an exec session: invalid argument\n/bin/sh: line 2: 1c9d9dd69dd3: command not found", "stderr_lines": ["Error: must provide a non-empty command to start an exec session: invalid argument", "/bin/sh: line 2: 1c9d9dd69dd3: command not found"], "stdout": "", "stdout_lines": []}

Expected behavior
It is expected that tripleo-ansible will either fail gracefully if it was unable to get single consistent container ID, or will use better search filter when determining container ID.

Bug impact
ReaR procedure is blocked until orphaned container is removed

Known workaround
Clean up orphaned entries

P.S. I understand that this is unlikely to get fixed, but reporting if I miss anything and we would want to actually fix this.

links to

RHBA-2025:149919 Red Hat OpenStack Platform 16.2.9 bug fix advisory

mentioned on

Merge request - Draft: B&R - Use --filter to stop galera containers

Assignee:: Juan Payno

Reporter:: Alex Stupnikov

QA Contact:: Archana Singh

Team:: rhos-dfg-upgrades

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Created:: 2025/02/17 3:20 PM

Updated:: 2025/09/13 3:11 AM

Resolved:: 2025/07/16 1:02 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates

PagerDuty