Loading...

XML

Word

Printable

Type: Bug
Resolution: Duplicate
Priority: Undefined
Fix Version/s: None
Affects Version/s: 4.12.z
Component/s: Monitoring
Labels:
None

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
None
Regression:
No

Target Backport Versions:
None
Target Version:
None
Release Blocker:
None
Sprint:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Test Coverage:

-

PX Priority Data:
PX Impact Score:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Description of problem:

After launching Thanos Querier, I noticed that Thanos consumes a lot of memory.
If memory limits were set to thanos, this situation led to OOM of thanos pods.
If memory limits were not set to thanos, In this situation thanos pods consumed all the memory of the node and eventually led to OOM of the node on which thanos pods were deployed.

Example explanation:
- If memory limits were applied to thanos are 2Gi:

  - It took too much time when made an API performance query from monitoring dashboard for a time range 15 min to 6 hours.
  - While when increased the time rage to 12 hours or more it was getting timed out and thanos pods were getting OOM kill.
  - When reduced the time range the pods were automatically getting stable until the time range was increased back to 12 hours or more.


- Tried increasing the memory limit for thanos to 4Gi:
  - Now Thanos pods were getting OOM killed when time rage was increased to 2 day or above.
  - Even for 1 day time range the console was facing issue loading the data.

Version-Release number of selected component (if applicable):

OCP: 4.12.11
Thanos: 0.28.1

How reproducible:

Steps to Reproduce:

1.
2.
3.

Actual results:

Thanos pod consume all the memory and are getting OOM killed.

Expected results:

Thanos pods should not consume too much memory

Additional info:

duplicates

OCPBUGS-3986 PromQL queries of the ""API Performance" dasboard can overload Thanos queriers

Closed

relates to

OCPBUGS-3986 PromQL queries of the ""API Performance" dasboard can overload Thanos queriers

Closed

Assignee:: Haoyu Sun

Reporter:: Aman Dev Verma

Need Info From:: None

Contributors:: None

QA Contact:: Junqi Zhao

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Created:: 2023/10/26 4:44 PM

Updated:: 2025/07/25 5:31 AM

Resolved:: 2023/10/31 10:53 AM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates

Hide