Uploaded image for project: 'Red Hat Advanced Cluster Management'
  1. Red Hat Advanced Cluster Management
  2. ACM-2163

Service monitoring and alerting for ACM

XMLWordPrintable

    • Service monitoring and alerting for ACM pillers
    • False
    • None
    • False
    • Green
    • Done
    • ACM-1585 - Integrate RHACM into Service Delivery
    • ACM-1585Integrate RHACM into Service Delivery

      Epic Goal

      • Define SLIs for components that will be used by Service Delivery.
      • Code instrumentation for agreed upon SLIs, expose metrics.
      • Define alerting rules for SLIs.
      • Determine starting SLO based on aggregation of our SLIs.

      Why is this important?

      • Meet SLA requirements that will be established as part of SD.
      • Service monitoring and alerting will be essential for quick RCA and resolution for service disruptions across environments.

      Scenarios

      1. ACM health pre and post install/upgrade
      2. Hypershift Addon and its installer components
      3. Policy controllers, but not the actual policy being applied
      4. Foundation components
      •  

      Acceptance Criteria

      • CI - MUST be running successfully with tests automated
      • Release Technical Enablement - Provide necessary release enablement details and documents.

      Dependencies (internal and external)

      1. Hypershift-addon
        1. agent addon
        2. agent Hypershift-operator
        3. agent External DNS
      1. Policy
        1. All controllers
        2. agents
      2. Foundation
        1. Hub side controllers
        2. Agent side controllers

      Previous Work (Optional):

      1. Server Foundation F2F 2022 discussion
      2. Hypershift addon document

      Open questions::

      1. Are there a set of signals SLI's that service devivery requires or suggests?
      2. How many of the signals can be just rules? (no code change required)

      Done Checklist

      • CI - CI is running, tests are automated and merged.
      • Release Enablement <link to Feature Enablement Presentation>
      • DEV - Upstream code and tests merged: <link to meaningful PR or GitHub Issue>
      • DEV - Upstream documentation merged: <link to meaningful PR or GitHub Issue>
      • DEV - Downstream build{}

              jpacker@redhat.com Joshua Packer
              showeimer Sho Weimer
              Juliana Hsu Juliana Hsu (Inactive)
              David Huynh, Derek Ho
              Song Lai Song Lai
              Joydeep Banerjee Joydeep Banerjee
              Bradd Weidenbenner Bradd Weidenbenner
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

                Created:
                Updated:
                Resolved: