Uploaded image for project: 'Hybrid Cloud Console'
  1. Hybrid Cloud Console
  2. RHCLOUD-43429

Update Kessel Operations dashboard to track operational KPIs

XMLWordPrintable

    • Product / Portfolio Work
    • 3
    • False
    • Hide

      None

      Show
      None
    • False
    • None
    • Unset
    • None

      https://docs.google.com/spreadsheets/d/143Nt09pGJERwpEEyP-LpnVMlU32cI-rosMZLFg2s2so/edit?gid=0#gid=0

      Dashboard: https://grafana.stage.devshift.net/d/kessel-ops/kessel-operations-dashboard

      • [DONE] I can see the performance of all Kessel APIs (requests, errors)
      • [DONE] I can see the performance of all Kessel APIs (latencies)
      • [TODO] Health of services (RBAC, Relations, Inventory)
      • [TODO] I can determine the health of Kessel operators (resource usage, uptime)
        • Operators: Clowder, SpiceDB
          Two gauges per operator: Reconciler health, all reconciler resources are working correctly
      • [TODO] Events
        • Gauge exposed by Inventory. Total attempts to enqueue vs successes.
        • Should cover external SP events like HBI outbox

              anatale.openshift Antony Natale
              rh-ee-tcreller Tyler Creller
              Votes:
              0 Vote for this issue
              Watchers:
              4 Start watching this issue

                Created:
                Updated:
                Resolved: