Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Undefined
Fix Version/s: None
Affects Version/s: 4.19.0
Component/s: Image Registry
Labels:
- trt-incident

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
Critical
Regression:
None

Target Backport Versions:
None
Target Version:
None
Release Blocker:
Proposed
Sprint:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Three single node aggregated jobs have failed now with 5072 failures, caused by not enough results to aggregate. Roughly half the jobs fail to install because the image registry is complaining about:

{Operator degraded (ImagePrunerJobFailed): ImagePrunerDegraded: Job has reached the specified backoff limit Operator degraded (ImagePrunerJobFailed): ImagePrunerDegraded: Job has reached the specified backoff limit}

Examples:

https://prow.ci.openshift.org/view/gs/test-platform-results/logs/aggregated-aws-ovn-single-node-upgrade-4.19-micro-release-openshift-release-analysis-aggregator/1866988278972944384

https://prow.ci.openshift.org/view/gs/test-platform-results/logs/aggregated-aws-ovn-single-node-upgrade-4.19-micro-release-openshift-release-analysis-aggregator/1867042320000487424

Or an example of the sub-job with the failure:

https://prow.ci.openshift.org/view/gs/test-platform-results/logs/periodic-ci-openshift-release-master-ci-4.19-e2e-aws-upgrade-ovn-single-node/1866988251177291776

The closest I was able to get to a root cause was here

I1212 00:08:31.988786      67 envvar.go:172] "Feature gate default state" feature="WatchListClient" enabled=false
I1212 00:08:31.988805      67 envvar.go:172] "Feature gate default state" feature="InformerResourceVersion" enabled=false
Error from server (Timeout): the server was unable to return a response in the time allotted, but may still be processing the request (get pods)

Marked critical as this is a payload blocker for 4.19 at this point.

Assignee:: Flavian Missi

Reporter:: Devan Goodwin

Need Info From:: None

Contributors:: None

QA Contact:: XiuJuan Wang

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2024/12/12 1:23 PM

Updated:: 2025/07/17 1:33 PM

Resolved:: 2024/12/13 2:01 PM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates