Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Major
Fix Version/s: None
Affects Version/s: None
Component/s: eBPF, FLP, Operator
Labels:
None

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
Critical
Epic Link:
ebpf-netflow-performance

Target Version:

netobserv-ocp4.12
Release Blocker:
None
Sprint:
NetObserv - Sprint 224, NetObserv - Sprint 225, NetObserv - Sprint 226, NetObserv - Sprint 227

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

Using the OOTB flowcollector CRD, the ebpf flow collectors repeatedly CrashLoop for reason OOMKill under a very modest network load.

AWS cluster withh 9 m5.2xlarge workers
Install NO with its default 100Mi memory limit in the flowcollector CRD
run the hey-ho app (https://github.com/jotak/hey-ho) with 10 projects, 10 deployments and 1 replica
oc get pods and watch the netobserv-ebpf-agent pods CrashLoop

The network traffic per node is 20K-300K flows/minute and roughly 200Mb/s spread for 1-2 pods per node.

We should remove the memory limit for the collector unless we know a correct limit for our target flow and traffic rates. The OOTB default should not crash so easily.

Assignee:: Joel Takvorian

Reporter:: Mike Fiedler

Need Info From:: None

Contributors:: None

Architect:: None

QA Contact:: Mike Fiedler

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2022/09/14 2:55 PM

Updated:: 2025/07/29 5:38 PM

Resolved:: 2022/11/01 4:42 PM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates