Uploaded image for project: 'Red Hat Advanced Cluster Management'
  1. Red Hat Advanced Cluster Management
  2. ACM-18082

ACM alerts never fire due to high threshold on `rate`

XMLWordPrintable

    • Quality / Stability / Reliability
    • 1
    • False
    • Hide

      None

      Show
      None
    • False
    • MCO Core Sprint 46, MCO Core Sprint 47
    • Low
    • None

      Similar to ACM-18001, several alerts have issues where we're taking a rate of a given metric and expect it to come above 10. Usually these metrics only increase rarely (for example once every 5 minute/spoke), which means the alert never fire unless the amount of spokes are super high.

      A review of these should be done, and see if we want to fix them, or drop them altogether. Ideally the alerts would fire on error percentages (i.e if 20% of all requests fail) instead of using a static rate.

      Alerts affected:

              rh-ee-dbuchana Daniel Buchanan
              rh-ee-jachanse Jacob Baungard Hansen
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

                Created:
                Updated:
                Resolved: