Loading...

XML

Word

Printable

Type: Bug
Resolution: Unresolved
Priority: Normal
Fix Version/s: CNV v4.23.0
Affects Version/s: None
Component/s: CNV Virt-Node
Labels:
None

Activity Type:
Quality / Stability / Reliability
Story Points:
0.42
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Component Fix Version(s):
None
Market:

Regression:
None

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

PX Impact Score:

Description of problem:

If the cluster has came into a state in which live migrations that have been triggered by kubevirt-workload-update are constantly failing, it produce a situation in which a target virt-launcher pod is created every 5 minutes for every VM in the cluster.
This results in thousands of virt-launcher pods with "Error" state lying around the cluster, overloading etcd and might cause the cluster to become less responsive in short time.

Version-Release number of selected component (if applicable):

all versions

How reproducible:

if VMIMs are failing, 100%

Steps to Reproduce:

1. reproduce bug https://issues.redhat.com/browse/RHEL-131697 (for example)
2. observe that a new virt-launcher target pod is created every 5 minutes and then got failed, over and over again for every VM in the cluster.
3.

Actual results:

thousands of Errored virt-launcher pods reside on the cluster in 1 day

Expected results:

there should be an exponential backoff mechanism in such case.

Additional info:

If there is a permanent issue, retries shouldn't be executed in constant intervals of 5 minutes.
Instead the next retry should be twice as long as the previous one.
Until an upper limit of, for example, 4 hours.

clones

CNV-74856 [Tracker] Live migration after workload update fails with operation failed: guest CPU doesn't match specification: missing features: pdcm

Assignee:: Stuart Gott

Reporter:: Oren Cohen

Contributors:: Dan Kenigsberg

QA Contact:: Denys Shchedrivyi

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Created:: 2025/12/17 11:56 AM

Updated:: 2025/12/17 1:27 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates