Loading...

XML

Word

Printable

Type: Feature
Resolution: Unresolved
Priority: Normal
Fix Version/s: None
Affects Version/s: None
Component/s: PM Monitoring
Labels:
- pm_ack+

Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Color Status:
Not Selected

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

Proposed title of this feature request

Clarify scalability expectations and support

What is the nature and description of the request?

We allow customers to create several kinds of resources (e.g. PrometheusRules in user namespaces). We have seen customers in that create such a large number of resources, that the monitoring stack hits scalability limits. For example when our ruler deployment has to evaluate several thousand alerting rules with expensive expressions (say aggregating the last 6 hours of samples), it simply takes too long.
We will of course do our best to support customers by suggesting strategies to reduce this load or other work-arounds. However some customers are unable (or rarely unwilling) to consider alternate approaches.
It would be great if we could clarify that our system will allow configurations, that create significant pressure on individual components. Unfortunately we won't be able to resolve every customers need in that respect.

We should also add and improve our alerts regarding the functioning of our stack. Customer should not be able to find scalability limits without getting an alert on the way.

Why does the customer need this? (List the business requirements)

A clearer understanding of what they can expect from our stack.

List any affected packages or components.

Alerting rules, Documentation

is related to

MON-2851 Insight into scalability bottlenecks

Closed

Assignee:: Roger Florén

Reporter:: Jan Fajerski

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Created:: 2022/10/06 2:26 PM

Updated:: 2024/05/03 3:25 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates