Some suggestions from the support team regarding monitoring that would be helpful for troubleshooting customer issues. These can each be useful for both support and customers.
- backend-worker (doesn't seem to be any way for us to monitor this currently and in fact this would have value for the customer in case they needed to scale up replicas)
- system-sidekiq (currently there is only the UI and rails console access, we could publish the sidekiq API and monitor those endpoints. Again this would bring value to customer also)
- There is an internal API on backend which system uses and it could be useful to monitor failed requests which would indicate data inconsistency (failure to synchronise data from system to backend)
- Zync makes outbound requests to an OpenID Connect provider server which can result in failures (this one is actually more useful for customers to monitor than for support to use for debugging)
|Add monitoring for the internal API endpoints||Closed||Unassigned|