Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Major
Fix Version/s: 4.20.0
Affects Version/s: 4.15, 4.16
Component/s: OLM
Labels:

Activity Type:
Incidents & Support
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
Important
Regression:
None

Target Backport Versions:

4.15.z, 4.17.z, 4.16.z, 4.18.z, 4.19.z
Target Version:

4.20.0
Release Blocker:
Rejected
Sprint:
Lillipup Sprint 272
sprint_count:
1

Customer Impact:

Customer Escalated

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Review Complete:
PX Impact Score:

Release Note Status:
Done
Release Note Type:
Bug Fix
Release Note Text:

Hide
* Before this update, the catalog Operator scheduled catalog snapshots for every 5 minutes. On clusters with many namespaces and subscriptions, snapshots would fail and cascade across catalog sources. As a result, the spikes in CPU loads effectively blocked installing and updating Operators. With this update, catalog snapshots are scheduled for every 30 minutes to allow enough time for the snapshotes to resolve. (link:https://issues.redhat.com/browse/OCPBUGS-43966[~~OCPBUGS-43966~~])

Show
* Before this update, the catalog Operator scheduled catalog snapshots for every 5 minutes. On clusters with many namespaces and subscriptions, snapshots would fail and cascade across catalog sources. As a result, the spikes in CPU loads effectively blocked installing and updating Operators. With this update, catalog snapshots are scheduled for every 30 minutes to allow enough time for the snapshotes to resolve. (link: https://issues.redhat.com/browse/OCPBUGS-43966 [ OCPBUGS-43966 ])

Escape Reason:
Escape Impact:
Corrective Measures:
SDLC stage when should've been found:
None

When trying to install an operator, the below is logged:

"Warning alert:CatalogSource health unknown This operator cannot be updated. The health of CatalogSource "redhat-operators" is unknown. It may have been disabled or removed from the cluster.CatalogSource CSView CatalogSource

The underlying error in logs is {{msg="error encountered while listing bundles: rpc error: code = DeadlineExceeded desc = context deadline exceeded" catalog="{redhat-operators openshift-marketplace} }}
As discussed we could not reproduce this locally and have attempted multiple times to simulate the appropriate grpc connection and exact api call, which succeeded for us.
Therefore the suspected cause is a network issue on the customer’s cluster, and we require full cooperation from a qualified cluster/network professional on the customer end who is aware of their exact config, and a detailed network dump/analysis what actually happened at the point in time when OLM got this timeout.
We cannot proceed with investigation based on the current info we have.

blocks

OCPBUGS-57352 [release-4.19] high snapshot rate on redhat-operators, OLM operator install hangs. RPC DeadlineExceeded while listing bundles.

Closed

is cloned by

OCPBUGS-57352 [release-4.19] high snapshot rate on redhat-operators, OLM operator install hangs. RPC DeadlineExceeded while listing bundles.

Closed

is duplicated by

OCPBUGS-58070 High latency etcd disk writes due to openshift-marketplace pods/OLM

Closed

is related to

OCPBUGS-48696 OLMv0: excessive catalog source snapshots cause severe performance regression [openshift-4.16.z]

Closed

OCPBUGS-61036 Operators are unable to Install/Update due to RPC DeadlineExceeded while listing bundles error.

Closed

OCPBUGS-56031 redhat-operators pod is consuming high cpu

Closed

relates to

OCPBUGS-58070 High latency etcd disk writes due to openshift-marketplace pods/OLM

Closed

links to

openshift/operator-framework-olm#1014: OCPBUGS-43966, OCPBUGS-57222: Synchronize From Upstream Repositories

mentioned in: Page Loading...

(1 is related to, 1 relates to, 1 links to, 1 mentioned in)

Assignee:: Jordan Keister

Reporter:: Jacob Shivers

Need Info From:: None

Contributors:: None

QA Contact:: Xia Zhao

Doc Contact:: None

Votes:: 3 Vote for this issue

Watchers:: 31 Start watching this issue

Created:: 2024/10/29 1:48 AM

Updated:: 2025/11/03 9:20 AM

Resolved:: 2025/10/21 4:15 AM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates