Loading...

XML

Word

Printable

Type: Story
Resolution: Unresolved
Priority: Undefined
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
None

Activity Type:
None
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Epic Link:
None
Story Points:
None

Target Version:
None
Release Blocker:
None
Sprint:
None

OSDOCS-2427 landed docs recommending cluster admins manually check for firing critical alerts pre-update. I'm not particularly excited about such a blanket policy, because non-OCP components can create their own critical alerts. But I don't mind discussing firing critical alerts as an "in case you're interested..." thing, and letting customer admins decide what to do about that additional context. One strategy that ~~OTA-1272~~ would unlock would be to have a conditional update risk for every release around the presence of firing critical alerts. This could be at the Cincinnati level, but that might confuse folks who expect conditionalEdges to be about risks where we actually understand some kind of cause-and-effect chain between the measured state and how it might negatively impact a cluster that chooses to update anyway. We could also inject the risk at in the CVO, although that removes our ability to automatically check the risk for existing releases, and it removes our ability to dynamically tune the risk post-release, if we decide it is too annoying. One potential middle ground would be to:

Have a well-known risk name (GenericAlerts?).
Have the CVO inject a first-guess rule like {{group (ALERTS {severity="critical"}
) ...}} so Cincinnati doesn't have to (unless we want to backfill something for older releases via Cincinnati).
If we decide the CVO's baked-in rule is broken, we could fix it (in new releases) and have Cincinnati serve a better GenericAlerts rule to existing releases.
If any CVO saw the GenericAlerts rule from Cincinnati for an update, they'd prefer the matcher Cincinnati was recommending over their baked-into-the-CVO rule.

Thoughts?

is blocked by

OTA-1272 Re-prioritize the default update recommendations by freshness

Closed

relates to

RFE-5104 Automate pre-update critical alert check

Refinement

Assignee:: Unassigned

Reporter:: W. Trevor King

Need Info From:: None

Contributors:: None

QA Contact:: None

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Created:: 2024/05/14 8:35 PM

Updated:: 2024/12/05 5:04 AM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates