Uploaded image for project: 'OpenShift Hive'
  1. OpenShift Hive
  2. HIVE-2618

Improve monitoring and operation through metrics and dashboards

XMLWordPrintable

    • Icon: Epic Epic
    • Resolution: Unresolved
    • Icon: Major Major
    • None
    • None
    • Metrics enhanced
    • False
    • None
    • False
    • Not Selected
    • To Do
    • 29% To Do, 14% In Progress, 57% Done
    • M

      Epic Goal

      • Improve the configurability, visibility and usefulness of the metrics that Hive exports and expose them in the team dashboards to improve our ability to respond to incidents and escalations

      Why is this important?

      • Improves the ability of SREs in diagnosing OSD/ROSA issues
      • Reduces the time investment in escalation troubleshooting
      • Reduces the amount of alerts
      • Gives better signal on the maturity of new features
      • Informs future priorities on where we should invest in improvements

      Acceptance Criteria

      • Document new metrics
      • Present new dashboards
      • Incorporate improvements in SOPs

      Previous Work (Optional):

      1. HIVE-2344 contains the design for what needs to be implemented here

      Open questions::

      Done Checklist

      • CI - CI is running, tests are automated and merged.
      • Release Enablement <link to Feature Enablement Presentation>
      • DEV - Upstream code and tests merged: <link to meaningful PR or GitHub Issue>
      • DEV - Upstream documentation merged: <link to meaningful PR or GitHub Issue>
      • DEV - Downstream build attached to advisory: <link to errata>
      • QE - Test plans in Polarion: <link or reference to Polarion>
      • QE - Automated tests merged: <link or reference to automated tests>
      • DOC - Downstream documentation merged: <link to meaningful PR>

              sumehta Suhani Mehta
              asegurap1@redhat.com Antoni Segura Puimedon
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated: