Uploaded image for project: 'Red Hat Advanced Cluster Management'
  1. Red Hat Advanced Cluster Management
  2. ACM-2817

Alerts for ServerFoundation addon

XMLWordPrintable

    • 5
    • False
    • None
    • False
    • Hide
      - SLO/SLI/Alerts can be seen using Promlens: https://promlens.stage.devshift.net/ by using PromQL
      - Simulate a failure condition (what should be failed will depend on the SLO/SLI definition) and make sure that it gets reflected in the SLO/SLI value accurately.
      - Simulate failure conditions and make sure alerts pick them up.
      - Make sure the subset of metrics as indicated by dev can be seen in ACM Grafana explorer (creation of dashboards is not in the scope)


      Show
      - SLO/SLI/Alerts can be seen using Promlens: https://promlens.stage.devshift.net/ by using PromQL - Simulate a failure condition (what should be failed will depend on the SLO/SLI definition) and make sure that it gets reflected in the SLO/SLI value accurately. - Simulate failure conditions and make sure alerts pick them up. - Make sure the subset of metrics as indicated by dev can be seen in ACM Grafana explorer (creation of dashboards is not in the scope)
    • Observability Sprint 2023-04
    • No

      Value:

      For Managed Services deployment of Hypershift clusters, we need to create SLO/SLI and alerts for Registration/Work Agent/ addon status etc because it is in the critical path hosted cluster creation.

      Server Foundation metrics here
      https://docs.google.com/document/d/1p6PeBKLsAvAeiiF0BSdncgVq5znBMO4qN3MrT-dVFpA/edit#heading=h.z74yx3s79gh5

      Foundation SLI/SLO/Alets doc (leyan@redhat.com)
      https://docs.google.com/document/d/1HYtzD3pn7d5-11Q_3IFpcRKzlkbw2yHqvZJy8pV6Vw4/edit

      Definition of Done for Engineering Story Owner (Checklist)

      • ...

      Development Complete

      • [ ] The code is complete. PR accepted in repo -
      • [ ] The SLO/SLI can be seen using Promlens - https://promlens.stage.devshift.net/ .
      • [ ] Add the metrics into ACM allow-list excluding the ones which create time series explosion

              dbennett@redhat.com Disaiah Bennett
              jbanerje@redhat.com Joydeep Banerjee
              Xiang Yin Xiang Yin
              Votes:
              0 Vote for this issue
              Watchers:
              7 Start watching this issue

                Created:
                Updated:
                Resolved: