Loading...

XML

Word

Printable

Type: Bug
Resolution: Done-Errata
Priority: Major
Fix Version/s: 4.14.0
Affects Version/s: 4.12, 4.11
Component/s: Networking / openshift-sdn
Labels:
- CNO
- ocpve

Severity:
Moderate
Regression:
None
Story Points:
3
Sprint:
OCP VE Sprint 225, OCP VE Sprint 226, OCP VE Sprint 227, OCP VE Sprint 228, OCP VE Sprint 229, OCP VE Sprint 230, OCP VE Sprint 231, OCP VE Sprint 232, OCP VE Sprint 233, OCP VE Sprint 234, OCP VE Sprint 235
sprint_count:
11
Release Blocker:
Rejected
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Release Note Text:
N/A
Release Note Type:
Release Note Not Required
Target Version:

4.14.0

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

I haven't gone back to pin down all affected versions, but I wouldn't be surprised if we've had this exposure for a while. On a 4.12.0-ec.2 cluster, we have:

cluster:usage:resources:sum{resource="podnetworkconnectivitychecks.controlplane.operator.openshift.io"}

currently clocking in around 67983. I've gathered a dump with:

$ oc --as system:admin -n openshift-network-diagnostics get podnetworkconnectivitychecks.controlplane.operator.openshift.io | gzip >checks.gz

And many, many of these reference nodes which no longer exist (the cluster is aggressively autoscaled, with nodes coming and going all the time). We should fix garbage collection on this resource, to avoid consuming excessive amounts of memory in the Kube API server and etcd as they attempt to list the large resource set.

is blocked by

SDN-3636 Kube 1.26 rebase for CNO

Closed

is cloned by

OCPBUGS-17721 [release-4.13] Node churn leaks PodNetworkConnectivityChecks

Closed

is depended on by

OCPBUGS-17721 [release-4.13] Node churn leaks PodNetworkConnectivityChecks

Closed

links to

openshift/cluster-network-operator#1566: OCPBUGS-1341: Set owner reference for pod network connectivity check

openshift/cluster-network-operator#1649: OCPBUGS-1341: Enhance check controller to remove old check objects

openshift/library-go#1430: OCPBUGS-1341: Remove stale pod network connectivity checks

RHEA-2023:5006 rpm

(2 links to)

Assignee:: Periyasamy Palanisamy

Reporter:: W. Trevor King

QA Contact:: Mike Fiedler

Votes:: 0 Vote for this issue

Watchers:: 11 Start watching this issue

Created:: 2022/09/14 9:26 PM

Updated:: 2024/05/22 6:40 AM

Resolved:: 2023/10/31 1:37 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates