-
Feature
-
Resolution: Duplicate
-
Normal
-
None
-
None
-
False
-
None
-
False
-
200
-
0.5
-
50% (Low)
-
3
-
16.67
User Story:
As an OpenShift user I'd like to be able to swiftly identify actions I need to take in order to aleviate cluster problems. Alert storms and overload of information make it hard to navigate to the root cause of issues of my clusters. As an OpenShift user I'd liek to be presented with the most significant alert or event that is causing the storm or an event that is a potential root cause of multiple alerts.
Goals:
inecas@redhat.com presented a prototype for statistical/ML reduction of alert noise based on alert grouping/clustering. The demo is available here: https://drive.google.com/file/d/1b_MffzMCquVbVHdCGlFnxxf4fJvLPD4L/view and https://drive.google.com/file/d/1jIKIXhhZtmeU4d8s9FeZrhJ_K9RXATBo/view
Crawl: Identify integration of above work with OCP Observability team and define integration story.
Walk: TBD (Customer facing!)