Loading...

XML

Word

Printable

Type: Bug
Resolution: Done-Errata
Priority: Major
Fix Version/s: rhos-18.0.8
Affects Version/s: rhos-18.0.5
Component/s: lib-common, openstack-operator
Labels:
None

Story Points:
8
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Docs Approval:
?
Fixed in Build:
openstack-operator-bundle-container-1.0.10-7
Regression:
None
Release Note Text:

Hide
.Failed service updates are being reflected accurately by the deployment status

Before this update, when updates to service configurations failed, the failure was not being reflected in the condition status of the deployment. Instead, the `Ready` condition showed as "True" because the new pods created by the update were not being considered when checking the deployment readiness. With this update, any new pods created during a configuration update are now considered when assessing deployment readiness. If rolling out new pods fails, then the deployment reflects that it is stuck in `Deployment in progress`.

Show
.Failed service updates are being reflected accurately by the deployment status Before this update, when updates to service configurations failed, the failure was not being reflected in the condition status of the deployment. Instead, the `Ready` condition showed as "True" because the new pods created by the update were not being considered when checking the deployment readiness. With this update, any new pods created during a configuration update are now considered when assessing deployment readiness. If rolling out new pods fails, then the deployment reflects that it is stuck in `Deployment in progress`.
Release Note Type:
Bug Fix
Release Note Status:
Done
Intelligence Requested:
Market:
Errata Link:
https://errata.engineering.redhat.com/advisory/148729

Severity:
Important

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

To Reproduce Steps to reproduce the behavior:

1) deploy a ctlplane e.g. with

OPENSTACK_IMG=quay.io/openstack-k8s-operators/openstack-operator-index:87ab1f1fa16743cad640f994f459ef14c5d2b9ca

2) update or add a configuration that the service pod fails to start, e.g.

OPENSTACK_IMG=quay.io/openstack-k8s-operators/openstack-operator-index:a3ed3f47c7e695b766c0c9e86148fd262e464629

3) the services reconcile, the new keystone pod will fail because of https://github.com/openstack-k8s-operators/keystone-operator/pull/541 , but the service is still ready because the old deployment pods are still healthy and can serve the service.

Identified with keystone, but I think others are affected, too.

Expected behavior

the keystone service should not reach ready and reflect that the deployment is not healthy to bring up the new pod

Bug impact

service continues to run with the old deployment until the underlying problem is fixed.
if the user just validates for the condition state, it reflects not that the new service pod is up

Additional context

The behavior is:

when there is an update to the service, the deployment gets updated and a rolling restart happens, when the new pod fails to start it is in the CrashLoopBackoff, like this

keystone-7fbb9c97b-4j7kk                                       1/1     Running            0               20m
keystone-cbd787c54-8h9kj                                       0/1     CrashLoopBackOff   5 (70s ago)     4m12s

but the deployment status is ok, because the minimum available replicas is still satisfied for the deployment

keystone                                       1/1     1            1           20m

and keystoneapi is happy

$ oc get keystoneapi
NAME       NETWORKATTACHMENTS   STATUS   MESSAGE
keystone                        True     Setup complete

as a result the ctlplane is happy, too

$ oc get osctlplane
NAME                                 STATUS   MESSAGE
openstack-galera-network-isolation   True     Setup complete

links to

#613 [Deployment] Add PollRolloutStatus and poll deployment on updates

[18.0-fr2][deployment/statefulset] add IsReady func

[deployment/statefulset] add IsReady func

openstack-k8s-operators/barbican-operator#235: Use deployment.IsReady() to validate status

openstack-k8s-operators/barbican-operator#239: [18.0-fr2] Use deployment.IsReady() to validate status

openstack-k8s-operators/designate-operator#315: Use deployment.IsReady() to validate status

openstack-k8s-operators/designate-operator#318: [18.0-fr2] Use deployment.IsReady() to validate status

openstack-k8s-operators/heat-operator#515: Use deployment.IsReady() to validate status

openstack-k8s-operators/heat-operator#518: [18.0-fr2] Use deployment.IsReady() to validate status

openstack-k8s-operators/horizon-operator#440: Use deployment.IsReady() to validate status

openstack-k8s-operators/horizon-operator#443: [18.0-fr2] Use deployment.IsReady() to validate status

openstack-k8s-operators/infra-operator#368: Use statefulset.IsReady() to validate status

openstack-k8s-operators/infra-operator#372: [18.0-fr2] Use statefulset.IsReady() to validate status

openstack-k8s-operators/infra-operator#374: [instanceha] switch to generic condition

openstack-k8s-operators/ironic-operator#539: Use deployment.IsReady() to validate status

openstack-k8s-operators/ironic-operator#542: [18.0-fr2] Use deployment.IsReady() to validate status

openstack-k8s-operators/keystone-operator#552: Poll deployment on update

openstack-k8s-operators/keystone-operator#560: Use deployment.IsReady() to validate status

openstack-k8s-operators/keystone-operator#566: [18.0-fr2] Use deployment.IsReady() to validate status

openstack-k8s-operators/neutron-operator#489: Use deployment.IsReady() to validate status

openstack-k8s-operators/neutron-operator#492: [18.0-fr2] Use deployment.IsReady() to validate status

openstack-k8s-operators/octavia-operator#472: Use deployment.IsReady() to validate status

openstack-k8s-operators/octavia-operator#475: [18.0-fr2] Use deployment.IsReady() to validate status

openstack-k8s-operators/openstack-baremetal-operator#294: Use deployment.IsReady() to validate status

openstack-k8s-operators/openstack-baremetal-operator#295: [18.0-fr2] Use deployment.IsReady() to validate status

openstack-k8s-operators/ovn-operator#428: Use deployment.IsReady() to validate status

openstack-k8s-operators/ovn-operator#430: Use statefulset.IsReady() to validate status

openstack-k8s-operators/placement-operator#304: Use deployment.IsReady() to validate status

openstack-k8s-operators/placement-operator#310: [18.0-fr2] Use deployment.IsReady() to validate status

openstack-k8s-operators/swift-operator#334: Use deployment.IsReady() to validate status

openstack-k8s-operators/swift-operator#338: [18.0-fr2] Use deployment.IsReady() to validate status

RHBA-2025:148729 Release of containers for RHOSO OpenStack Podified operator

mentioned on

Merge request - Updated US source to: 8b74840 Merge pull request #338 from stuggi/OSPRH-14472-fr2

Merge request - Updated US source to: 9c7b406 Merge pull request #492 from stuggi/OSPRH-14472-fr2

Merge request - Updated US source to: 42a6a9f Merge pull request #443 from stuggi/OSPRH-14472-fr2

Merge request - Updated US source to: 87eedb8 Merge pull request #385 from lmiccini/mirrored-fr2

Merge request - Updated US source to: 795c57b Merge pull request #372 from stuggi/OSPRH-14472-fr2

Merge request - Updated US source to: 2795f57 Merge pull request #522 from openshift-cherrypick-robot/cherry-pick-521-to-18.0-fr2

Merge request - Updated US source to: 5433e0a Merge pull request #239 from stuggi/OSPRH-14472-fr2

Merge request - Updated US source to: 6583066 Merge pull request #295 from stuggi/OSPRH-14472-fr2

Merge request - Updated US source to: a3a67a9 Merge pull request #542 from stuggi/OSPRH-14472-fr2

Merge request - Updated US source to: b5f8c7b Use deployment.IsReady() to validate status

Merge request - Updated US source to: c8024af Merge pull request #518 from stuggi/OSPRH-14472-fr2

Merge request - Updated US source to: d08f821 Merge pull request #318 from stuggi/OSPRH-14472-fr2

Merge request - Updated US source to: d275e9c Merge pull request #475 from stuggi/OSPRH-14472-fr2

Merge request - Updated US source to: f45575b Merge pull request #480 from gthiemonge/18.0-fr2-custom_tenant_name

Merge request - Updated US source to: fc5641f Merge pull request #566 from stuggi/OSPRH-14472-fr2

(27 links to, 24 mentioned on)

Assignee:: Martin Schuppert

Reporter:: Martin Schuppert

Team:: rhos-conplat-core-operators

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Created:: 2025/03/04 4:06 PM

Updated:: 2025/05/21 6:57 PM

Resolved:: 2025/05/21 6:57 PM

Target start:: 2025/04/29

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates

PagerDuty