Loading...

XML

Word

Printable

Type: Bug
Resolution: Unresolved
Priority: Major
Fix Version/s: CNV v4.16.z
Affects Version/s: None
Component/s: CNV Install, Upgrade and Operators
Labels:
None

Activity Type:
Quality / Stability / Reliability
Story Points:
0.42
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Component Fix Version(s):
None
Market:

Sprint:
CNV I/U Operators Sprint 285

Regression:
None

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

PX Impact Score:

The test `TestUpgradeIUO::test_alerts_fired_during_upgrade` is consistently failing in the CNV 4.16 Z-stream upgrade job across multiple builds.

Failing test

tests/install_upgrade_operators/product_upgrade/test_upgrade_iuo.py::TestUpgradeIUO::test_alerts_fired_during_upgrade

Error

AssertionError: Following alerts were fired during upgrade:
[{'labels': {'alertname': 'OutdatedVirtualMachineInstanceWorkloads', 'severity': 'warning', 'operator_health_impact': 'none', 'namespace': 'openshift-cnv', 'pod': 'virt-controller-...', 'job': 'kubevirt-prometheus-metrics', ...},
  'annotations': {'summary': 'Some running VMIs are still active in outdated pods after KubeVirt control plane update has completed.',
                   'runbook_url': 'https://github.com/openshift/runbooks/blob/master/alerts/openshift-virtualization-operator/OutdatedVirtualMachineInstanceWorkloads.md'},
  'state': 'pending'}]

Root cause (observed)
During CNV Z-stream upgrade (4.16.z -> 4.16.30), the `OutdatedVirtualMachineInstanceWorkloads` alert fires with state `pending`. The alert means that some VMI workloads are still running in outdated virt-controller pods after the KubeVirt control plane update completes. The test asserts that no such alert should be fired during upgrade.

Affected builds (CNV 4.16 Z-stream, both identical failures)

Build #244 (2026-02-20) - Results: 40 Passed, 1 Failed, 2 Skipped
Build #245 (2026-02-27) - Results: 40 Passed, 1 Failed, 2 Skipped

Environment

CNV version: 4.16.30 (hco-bundle-registry-container-v4.16.30.rhel9-42)
OCP version: 4.16.57
IIB: registry-proxy.engineering.redhat.com/rh-osbs/iib:1092425
Cluster: cnv416z-upg.rhos-psi.cnv-qe.rhood.us
Storage: ocs-storagecluster-ceph-rbd-virtualization
Errata: 157271 (SHIPPED_LIVE)

Impact
This is a T2 upgrade test. Two consecutive weekly upgrade runs for CNV 4.16 have 1 failure each with the same test. The test result is consistent across different cluster instances (different cluster IDs), confirming this is not a flake but a reproducible issue.

Next steps

Investigate whether the OutdatedVirtualMachineInstanceWorkloads alert is expected to fire during Z-stream upgrade (and the test expectation is wrong)
Or confirm this is a product bug where VMI workloads are not migrated to updated pods in time
Check if this is a known issue for CNV 4.16.30
Runbook: https://github.com/openshift/runbooks/blob/master/alerts/openshift-virtualization-operator/OutdatedVirtualMachineInstanceWorkloads.md

Assignee:: Ohad Revah

Reporter:: Miroslav Sedlak

QA Contact:: Ohad Revah

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Created:: 2026/03/03 4:41 PM

Updated:: 2026/03/10 10:17 AM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates