XML

Word

Printable

Type: Spike
Resolution: Done
Priority: Undefined
Fix Version/s: None
Affects Version/s: None
Labels:
None

Blocked:
False
Blocked Reason:
None
Ready:
False
Epic Link:
MCO state reporting
Feature Link:
OCPSTRAT-845 - [Tech Preview] Proper MCO State Reporting
Intelligence Requested:
Market:

Sprint:
MCO Sprint 232, MCO Sprint 233
Cost of Delay:
0
WSJF:
0

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

Background:

Right now it is really difficult to figure out which node is the "problem node" in certain states, because all the MachineConfigPool shows you is the nodes and their counts.

You can know there is a node with a problem, but it's hard to know which node is the problem without inspecting all the node conditions (and if it's something that doesn't surface on `oc get node` – like DiskPressure – woe be to you)

There are customers that have a ton of nodes, so we have to be mindful of etcd space and making our pool output noisy.

I think it makes sense to conservatively only show nodes with "problems", but we need to be mindful that if we consider "not updated yet" to be a problem, the list will reset every time we roll out a new machineconfig.

Some Options:

Add node states to the pool CRD
New CRD containing pool states
Add some sugar to `oc` to make state easier to retrieve

Goal:

Figure out an efficient/non-noisy/non-annoying way to clearly show which nodes are in bad states

Done When:

We can write the story to do the actual work of adding the node state display

Assignee:: Charles Doern

Reporter:: John Kyros

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2023/01/24 3:26 AM

Updated:: 2023/03/27 12:10 AM

Resolved:: 2023/03/27 12:10 AM

Details

Description

Background:

Some Options:

Goal:

Done When:

Attachments

Easy Agile Planning Poker

Activity

People

Dates