Details
-
Task
-
Resolution: Done
-
Major
-
None
-
None
Description
Document SOPs for resolving issues with the Elasticache Redis instance being used for Quay's database model cache. Add Prometheus alerting rules to catch issues early.
Possible scenarios include:
- High Redis CPU
- Drastic drop in cache hit rate
- Drastic drop in Redis network bytes out (meaning cache is not responding)