Uploaded image for project: 'Red Hat Advanced Cluster Management'
  1. Red Hat Advanced Cluster Management
  2. ACM-2163

Service monitoring and alerting for ACM

XMLWordPrintable

    • Service monitoring and alerting for ACM pillers
    • False
    • None
    • False
    • Green
    • Done
    • ACM-1585 - Integrate RHACM into Service Delivery
    • ACM-1585Integrate RHACM into Service Delivery
    • 0
    • 0% 0%

      Epic Goal

      • Define SLIs for components that will be used by Service Delivery.
      • Code instrumentation for agreed upon SLIs, expose metrics.
      • Define alerting rules for SLIs.
      • Determine starting SLO based on aggregation of our SLIs.

      Why is this important?

      • Meet SLA requirements that will be established as part of SD.
      • Service monitoring and alerting will be essential for quick RCA and resolution for service disruptions across environments.

      Scenarios

      1. ACM health pre and post install/upgrade
      2. Hypershift Addon and its installer components
      3. Policy controllers, but not the actual policy being applied
      4. Foundation components
      •  

      Acceptance Criteria

      • CI - MUST be running successfully with tests automated
      • Release Technical Enablement - Provide necessary release enablement details and documents.

      Dependencies (internal and external)

      1. Hypershift-addon
        1. agent addon
        2. agent Hypershift-operator
        3. agent External DNS
      1. Policy
        1. All controllers
        2. agents
      2. Foundation
        1. Hub side controllers
        2. Agent side controllers

      Previous Work (Optional):

      1. Server Foundation F2F 2022 discussion
      2. Hypershift addon document

      Open questions::

      1. Are there a set of signals SLI's that service devivery requires or suggests?
      2. How many of the signals can be just rules? (no code change required)

      Done Checklist

      • CI - CI is running, tests are automated and merged.
      • Release Enablement <link to Feature Enablement Presentation>
      • DEV - Upstream code and tests merged: <link to meaningful PR or GitHub Issue>
      • DEV - Upstream documentation merged: <link to meaningful PR or GitHub Issue>
      • DEV - Downstream build{}

            jpacker@redhat.com Joshua Packer
            showeimer Sho Weimer
            Juliana Hsu Juliana Hsu (Inactive)
            David Huynh, Derek Ho
            Song Lai Song Lai
            Joydeep Banerjee Joydeep Banerjee
            Bradd Weidenbenner Bradd Weidenbenner
            Votes:
            0 Vote for this issue
            Watchers:
            7 Start watching this issue

              Created:
              Updated:
              Resolved: