Uploaded image for project: 'Project Quay'
  1. Project Quay
  2. PROJQUAY-9224

Update on-call runbooks with specific troubleshooting steps for `HTTP 500` authentication failures.

XMLWordPrintable

    • Icon: Task Task
    • Resolution: Can't Do
    • Icon: Normal Normal
    • None
    • None
    • quay.io
    • False
    • Hide

      None

      Show
      None
    • False

      *Issue:* The on-call engineer had to manually diagnose that a Kinesis key rotation was the cause of `HTTP 500` errors on authentication requests, as this information was not available in any runbook.
      *Corrective Action:* Update the primary on-call runbook to include a troubleshooting guide for authentication failures. This guide should list potential causes like credential rotation or logging failures, along with commands and dashboard links to investigate each one.
      *Result:* This will decrease the time it takes for an on-call engineer to diagnose and resolve common critical errors.

              Unassigned Unassigned
              doconnor@redhat.com Dave O'Connor
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated:
                Resolved: