Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Minor
Fix Version/s: netobserv-1.3
Affects Version/s: netobserv-1.2
Component/s: eBPF
Labels:
- perfscale_baremetal

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
Low

Target Version:

netobserv-1.3
Release Blocker:
None
Sprint:
NetObserv - Sprint 236, NetObserv - Sprint 237, NetObserv - Sprint 238, NetObserv - Sprint 239

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

We've observed over several rounds of testing NetObserv on our Baremetal cluster where one or more eBPF pods with be OOMKilled and go into CrashLoopBackOff state: https://docs.google.com/document/d/1DOfV17DEuqI0YSW6oOLc_XQlze-ZFS5FtqYf39n179E/edit?usp=sharing

This can be mitigated by increasing the eBPF memory limit from the default 800Mi - we've seen success with a limit of 2000Mi though this is likely tied to the amount of traffic we are generating.

Marking this as low severity as it doesn't seem to result in dropped flows, but we should either increase the default memory limit for eBPF or try and find a way to balance this load across other eBPF pods.

split from

NETOBSERV-902 QE: Run performance tests for 1.2 release

Closed

links to

netobserv/netobserv-ebpf-agent#118: NETOBSERV-983: Change aggregation flow map to hashmap instead perCPU hashmap

Assignee:: Mohamed Mahmoud (Inactive)

Reporter:: Nathan Weinberg

Need Info From:: None

Contributors:: None

Architect:: None

QA Contact:: Nathan Weinberg

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2023/04/14 4:54 PM

Updated:: 2025/07/29 5:35 PM

Resolved:: 2023/07/18 3:46 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates