-
Task
-
Resolution: Done
-
Major
-
None
-
None
-
None
-
1
-
False
-
None
-
False
-
-
-
OTA 245, OTA 246, OTA 247
During the incident, only a high severity alert PEHighLatency triggered, AppSRE is not paged, SRE-P got paged via client side alerts.
Need to create slo burn rate alerts matching slo document
Compare alert setting from client site, ensure when the service is not available, AppSRE and Cincinnati team should be paged before SRE-P paged.
RCA document: link
Definition of done:
- create alerts for slo burn rate matching slo document
- ensure alerts have executable runbook and working grafana dashboard
- is related to
-
OTA-769 create alert for conditions that caused 2022-08-25 OSUS incident
- Closed
- mentioned on