Loading...

XML

Word

Printable

Type: Task
Resolution: Obsolete
Priority: Major
Fix Version/s: None
Affects Version/s: None
Component/s: prometheus-adapter
Labels:
- groomed
- upstream

Blocked:
False
Ready:
False
Epic Link:
MON-1277
Docs QE Status:
NEW
QE Status:
NEW
Release Note Text:
Undefined
Market:

Sprint:
Monitoring - Sprint 204

SFDC Cases Links:
SFDC Cases Counter:
SFDC Cases Open:

Since 4.8, prometheus-adapter is now highly available in OpenShift, but only one of the two instances of prometheus-adapter is responding to requests from the apiserver. This is caused by the kube-aggregator not loadbalancing requests between the replicas behind the API service.

This has the following impacts:

availability problems - if the active prometheus-adapter has a failure it can take some time for apiserver to notice this.
wastes of resources - requires allocating resources needed to serve API for both instances, but only one is doing the job.

FWIW, I tried adding Service load balancing, but proxying doesn't seem to go through it as I would have expected and queries are still rooted to the same instance, so this shouldn't solve the issue.

Upstream issue in metrics-server: https://github.com/kubernetes-sigs/metrics-server/issues/764

Since we are not responsible for the kube-aggregator, it might be a good idea to discuss this with the apiserver team.

Assignee:: Damien Grisonnet

Reporter:: Damien Grisonnet

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Created:: 2021/06/29 10:28 AM

Updated:: 2022/03/16 4:29 PM

Resolved:: 2021/07/20 12:45 PM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates