Loading...

XML

Word

Printable

Type: Bug
Resolution: Duplicate
Priority: Major
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
None

Story Points:
5
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Discussed with Team:
Yes

Sprint:
MK - Sprint 232

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

WHAT

Looking at the production OSD instance's `prometheus-kafka-prometheus-0` pod's metrics in the `managed-application-services-observability` namespace the memory and CPU limits seem to be lower than what the system seems to require.

WHY

While looking at Single AZ cluster and its requirements for the additional nodes it came to observation that most of the pods in the namespace `managed-application-services-observability` do not define explicit cpu/memory limits, even where specified seem to be under-represented by their need. In tightly packed system like single AZ the pods can be placed on this node claiming all the CPU and memory which would leave with nor resources expend on observability needs.

HOW

Correct the CPU/Memory requirements for the pods in question. We can take a look at the current usage over time and use the maximum plus some buffer of the historical data as the new limit. Make sure that the limits are reasonable for the node size.

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

screenshot-1.png
43 kB
2022/03/17 9:01 PM

is duplicated by

MGDSTRM-9546 Ensure that all Observability Stack components have resource settings

Backlog

Assignee:: Unassigned

Reporter:: Ramesh Reddy

Team:: MK - Control Plane

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2022/03/17 9:01 PM

Updated:: 2023/02/14 9:53 AM

Resolved:: 2023/02/14 9:53 AM

Details

Description

WHAT

WHY

HOW

Attachments

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates