-
Story
-
Resolution: Done
-
Normal
-
None
-
Future Sustainability
-
2
-
False
-
-
False
-
None
-
Unset
-
None
-
-
Alert on a high number of org-id lookup errors
We can/should expect some failures
We need to create an alert that watches for an increase in the number of org-id lookup failures. Cloud-Connector has a metric (cloud_connector_auth_gateway_account_lookup_status_code_counter [1]) that records the response from the auth-gateway (the auth-gateway looks up the org-id for cloud-connector).
The alert should probably watch for an increase in the number of counts where the `status_code` is any 500 http code.
[1] https://github.com/RedHatInsights/cloud-connector/blob/master/internal/controller/metrics.go#L37