Loading...

XML

Word

Printable

Type: Feature
Resolution: Won't Do
Priority: Blocker
Fix Version/s: 2021-M2
Affects Version/s: None
Component/s: Camel-K
Labels:
- Enhancement
- mcs
- rhmi

Regression Test:
Todo
Target Release:

Camel-K-M4
Git Pull Request:
https://github.com/apache/camel-k/pull/1839

SFDC Cases Links:
SFDC Cases Counter:
SFDC Cases Open:

What

Create standard operating procedures (SOPs) for addressing breaches of SLOs

Why

So SRE can investigate the cause of an Alert without needing extensive Service specific knowledge, and ultimately get the Service back into a good state before the SLO is breached

How

A SOP, in the context of RHMI Monitoring & Alerting, is a document that has a clear set of steps to troubleshoot why an Alert might be firing, and how to fix the problem. SOPs should assume the reader has a high level of OpenShift & Kubernetes knowledge, but doesn’t have much, if any, service specific knowledge. Any service specific terms or concepts relevant to the Alert should be clearly defined and explained how they are relevant to the firing Alert. The SOP should specify how to verify the issue is fixed after taking remedial action.
An example SOP can be seen in the Appendix.

Futher Information:

Write SOPs

relates to

ENTESB-13661 Camel K operator Level 4

Closed

Assignee:: Antonin Stefanutti

Reporter:: David Ffrench

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2020/07/16 4:09 PM

Updated:: 2024/10/14 1:40 PM

Resolved:: 2021/03/23 10:20 AM

Details

Description

What

Why

How

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates