Loading...

XML

Word

Printable

Type: Feature Request
Resolution: Won't Do
Priority: Normal
Fix Version/s: None
Affects Version/s: None
Component/s: Monitoring
Labels:

Target Version:
None
Activity Type:
Product / Portfolio Work
Status Summary:
None
Blocked:
False
Blocked Reason:
None
Products:
None
Hierarchy Progress Bar:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Review Complete:
None
PX Impact Score:
PX Impact Range:
PX Priority Data:
PX Technical Impact:
PX Technical Impact Notes:
None
PX Scheduling Request:
None

1. Proposed title of this feature request
Provide ability to horizontally scale Prometheus

2. What is the nature and description of the request?
Today a Prometheus instance scrapes all the endpoints, which limits the number of endpoints/series that can be collected. This request is to provide a way of addressing this limitation so that resources, especially memory, required by a single Prometheus instance stay reasonable.

3. Why does the customer need this? (List the business requirements here)
Running a big cluster (300 nodes made of big bare-metal servers) to support lots of jobs getting created at the same time Prometheus is currently configured with 500GB and still gets OOM killed time to time when job pods get in crashloopback for whatever reason.

4. List any affected packages or components.
Monitoring/Prometheus

links to

Prometheus OOM investigation

Assignee:: Roger Florén

Reporter:: Frederic Giloux (Inactive)

Need Info From:: None

Votes:: 2 Vote for this issue

Watchers:: 19 Start watching this issue

Created:: 2020/11/24 11:45 AM

Updated:: 2025/09/13 6:25 PM

Resolved:: 2023/03/27 9:43 AM

Target start:: None

Target end:: None

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates