Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Major
Fix Version/s: netobserv-1.5
Affects Version/s: netobserv-1.5-candidate
Component/s: FLP, Loki
Labels:
- perfscale_120_nodes

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
Important

Target Version:
None
Release Blocker:
None
Sprint:
NetObserv - Sprint 248

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

NOTE: This is a different 429 error than the one described in ~~NETOBSERV-975~~

Description of problem:

When running our large-scale PerfScale scenario with NetObserv 1.5, we are seeing a large number of dropped flows due to a Loki 429 error

Steps to Reproduce:

1. Deploy an OCP4.14 cluster and scale to 120 nodes
2. Install NetObserv 1.5, Loki Operator with a 1x.medium LokiStack, and AMQ Streams Operator
3. Run the cluster-density-v2 workload with a variable of 480

Actual results:

Flows are dropped due to the following error (seen on various FLP pods)

time=2024-01-23T19:31:21Z level=info component=client error=server returned HTTP status 429 Too Many Requests (429): Maximum active stream limit exceeded, reduce the number of active streams (reduce labels or reduce label values), or contact your Loki administrator to see if the limit can be increased, user: 'network' fields.level=warn fields.msg=error sending batch, will retry host=lokistack-gateway-http.netobserv.svc:8080 module=export/loki status=429

Expected results:

No flows should be dropped

Additional Info:

This was seen in performance runs 0dc5303c-301d-4d1a-8c4c-0d7ef100b5dc and 911c279c-5c58-49b9-82ac-a61508262c44 - the env details from the latter as well as a must-gather are below/attached - additional data from those runs can be found here

OCP: 4.14.0-0.nightly-2024-01-18-061723
NetObserv operator: v1.5.0
Loki: v5.8.2
eBPF-agent: v1.5.0-76
FLP: v1.5.0-76
ConsolePlugin: v1.5.0-76

must-gather: https://drive.google.com/file/d/1kTxe4dElC_FJ5ipL_QINngNaNag_IuRU/view?usp=drive_link

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

Capture d’écran du 2024-02-01 11-34-18.png
32 kB
2024/02/01 10:34 AM

split from

NETOBSERV-1330 Run performance tests for 1.5 release

Closed

links to

netobserv/network-observability-operator#554: NETOBSERV-1458: reduce loki streams, index FlowLayer

netobserv/network-observability-operator#555: NETOBSERV-1458: reduce loki streams, index FlowInfra

mentioned on

Merge request - Updated 3 upstream sources

Assignee:: Joel Takvorian

Reporter:: Nathan Weinberg

Need Info From:: None

Contributors:: None

Architect:: None

QA Contact:: Nathan Weinberg

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2024/01/23 7:48 PM

Updated:: 2025/07/29 5:32 PM

Resolved:: 2024/02/02 5:35 PM

Details

Description

Attachments

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates