Loading...

XML

Word

Printable

Type: Story
Resolution: Done
Priority: Critical
Fix Version/s: None
Affects Version/s: None
Labels:
None

Activity Type:
None
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Epic Link:
None
Story Points:
None

Target Version:
None
Release Blocker:
None
Sprint:
None

Hive controllers should settle down to a low roar in steady state.

When a config change happens, we expect a spike, which should last a relatively short time, then settle back to steady state.

If we can figure out what "steady state" and such spikes look like, we should alert if a spike lasts longer than we expect. This can point to bugs in controllers, such as the MachinePool ownedLabels/ownedTaints thrash from ITN-2024-00101 / ~~HIVE-2541~~

Ideally this metric would be tracking the time between when a request is queued and when it is serviced. That's Hard™. But we should be able to track basic controller things like queue depth. A problem we saw was that these upstream controller metrics didn't seem to be available via hive!

depends on

ACM-13152 Separate MachinePool controller and make it shardable

Review

HIVE-2537 Separate MachinePool controller and make it shardable

Closed

links to

https://gitlab.cee.redhat.com/service/app-interface/-/merge_requests/112654

openshift/hive#2355: Remove redundant import of generic admission server cmd

Assignee:: Suhani Mehta

Reporter:: Eric Fried

Need Info From:: None

Contributors:: None

QA Contact:: None

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 7 Start watching this issue

Created:: 2024/06/18 6:09 PM

Updated:: 2024/08/07 11:41 PM

Resolved:: 2024/07/25 4:26 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates