Loading...

XML

Word

Printable

Type: Bug
Resolution: Won't Do
Priority: Major
Fix Version/s: None
Affects Version/s: 4.16.z
Component/s: Node / CRI-O
Labels:

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
None
Regression:
None

Target Backport Versions:
None
Target Version:
None
Release Blocker:
None
Sprint:
SDN Sprint 266, OCP Node Sprint 275 (green), OCP Node Sprint 276 (green), OCP Node Sprint 277 (green), OCP Node Sprint 278 (green), OCP Node Sprint 279 (green)
sprint_count:
6

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Impact Score:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Description of problem:

Some of the Openshift Dedicated clusters were found leaking veth* interfaces.

Those clusters are 4.16 with SDN.

In one of the worker node of such clusters, we can see there are 5000+ veth interfaces, while there are only 70 containers running.

$ ip link | grep veth | wc -l
5433
$ crictl ps | wc -l
70

We found this issue because the node-exporter pod was consuming high cpu, then found it was due to it need to read large amount of veth* info causing the high cpu. Ref OCPBUGS-44100 .

Version-Release number of selected component (if applicable):

4.16

How reproducible:

The issue only happens to some clusters, and only some of the nodes have this issue. Replacing/rebooting the node can workaround the issue, but the issue may recur on some of the replaced/rebooted nodes again.

Steps to Reproduce:

We haven't figured out what triggers the problem or how to reproduce.

Actual results:

The number of veth* interfaces in a node keeps growing.

Expected results:

The number of veth* interfaces is similar to the number of running containers.

Additional info:

Affected Platforms:

Is it an SD issue. Impact some of the OSD clusters.

Must-gather and sosreport will be attached in the next comment.

Related ticket OHSS-38706 .

Assignee:: Sohan Kunkerkar

Reporter:: Siu Wa Wu

Need Info From:: None

Contributors:: None

QA Contact:: Zhanqi Zhao

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 15 Start watching this issue

Created:: 2025/01/07 1:34 AM

Updated:: 2025/11/05 12:22 PM

Resolved:: 2025/11/05 12:22 PM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates