-
Story
-
Resolution: Duplicate
-
Major
-
None
-
None
-
False
-
None
-
False
-
-
As SREP engineers, we don't want to be notified of alerts relating to clusters in limited support. The traditional way of silencing a cluster altogether doesn't work for alerts firing from a Hive instance but related to a cluster that the operator is managing. An example of this is the alert related to the ClusterSync failures.
We need a way to silence or prevent these alerts, the proposed solution at the moment is to create a metric indicating whether a cluster is in limited support that could be used in the expression of the alert.
The discussion is still open if there's another way to silence LS clusters I'm happy to discuss it. This remains an important requirement for this kind of alert.
Done Criteria:
- A way to prevent alerts associated with a cluster in limited support from firing is identified or implemented
- depends on
-
HIVE-2344 Enhancement: Redesign metrics
- Closed
- is blocked by
-
HIVE-2608 Implement MetricsConfig redesign
- Closed
- relates to
-
HIVE-2642 Extend support for minDuration and additional labels to all metrics that can support them
- To Do
-
HIVE-2659 Add clusterDeploymentLabelSelector support
- To Do
-
HIVE-1857 Identify individual clusters failing cluster sync
- Closed
-
HIVE-2286 Identify a reason in the ClusterSyncFailingSeconds metric
- Closed
- links to