-
Feature Request
-
Resolution: Done
-
Minor
-
None
-
None
-
False
-
False
-
Undefined
-
-
-
-
-
-
1. Proposed title of this feature request
Have additional information with alert etcdMembersDown
2. What is the nature and description of the request?
This request is to provide additional information when the alert etcdMembersDown is triggered.
The current alert definition is missing any information on the etcd members that went down. Ideally this would be provided as part of the alert. As it would require a change in the alert another option may be to provide a quick way for getting to this information in the description. Something like run "$ oc get pods -n openshift-etcd | grep -v etcd-quorum-guard | grep etcd" and check the values in the status, age or restarts columns.
3. Why does the customer need this? (List the business requirements here)
This is a critical alert. In case of quorum loss no write nor read can be performed. Customers need to react quickly and that would help them with that.
4. List any affected packages or components.
This single alert definition.
- is incorporated by
-
OCPSTRAT-454 improve etcd dashboard, alerts & metrics
- Closed
- is related to
-
OCPPLAN-6250 [etcd Spike] Increase the overall quality for OpenShift's OOTB alerting rules
- Closed