Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Major
Fix Version/s: netobserv-1.4
Affects Version/s: netobserv-1.3, netobserv-1.2
Component/s: FLP, Loki
Labels:
- perfscale_65_nodes

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
Important

Target Version:
None
Release Blocker:
None
Sprint:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

In recent NetObserv 1.2 performance testing, we have witnessed the following behavior:

During spikes of load, certain eBPF pods will be OOMKilled and go into CrashLoopBackOff state. Notably these have been observed to be the same pods co-located on nodes with LokiStack resources, which have high memory usage
- This behavior was observed with both small and medium sized LokiStacks as well as the default eBPF memory limit of 800Mi as well as an increased limit of 1000Mi
Flows continue to be processed but some are dropped during these spike periods.

The operator recovers after the load spikes end with eBPF pods recovering and flows returning to being written.

Opening this bug to track the behavior and gather more data.

Discussions relating to this bug:

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

image-2023-04-07-15-19-35-241.png
30 kB
2023/04/07 10:19 PM
image-2023-04-07-15-20-00-557.png
126 kB
2023/04/07 10:20 PM
image-2023-04-07-15-20-42-156.png
229 kB
2023/04/07 10:20 PM

relates to

NETOBSERV-717 Loki per_stream_rate_limit

Closed

split from

NETOBSERV-902 QE: Run performance tests for 1.2 release

Closed

NETOBSERV-1068 Run performance tests for 1.3 release

Closed

links to

netobserv/netobserv-ebpf-agent#119: NETOBSERV-975: avoid preallocating huge chunk of memory by default

openshift/openshift-docs#63934: Network Observability 1.4 Release Notes

RHSA-2023:116729 Network Observability 1.4.0 for OpenShift

(1 links to)

Assignee:: Julien Pinsonneau

Reporter:: Nathan Weinberg

Need Info From:: None

Contributors:: None

Architect:: None

QA Contact:: Nathan Weinberg

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 6 Start watching this issue

Created:: 2023/04/05 10:34 PM

Updated:: 2025/07/29 5:35 PM

Resolved:: 2023/08/10 5:18 PM

Details

Description

Attachments

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates