Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-12760

Add runbook to PrometheusRemoteStorageFailures

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Normal Normal
    • None
    • 4.10
    • Monitoring
    • Moderate
    • No
    • MON Sprint 238
    • 1
    • False
    • Hide

      None

      Show
      None
    • This enhancement adds some context when the PrometheusRemoteStorageFailures is triggered, providing a link to the runbook on how to fix it and a link to https://github.com/openshift/runbooks.
    • Enhancement

      Description of problem:

      When the PrometheusRemoteStorageFailures alert fires, it's not clear for cluster admins what is the impact and how they can resolve it.

      Version-Release number of selected component (if applicable):

      4.10

      How reproducible:

       

      Steps to Reproduce:

      1. Configure CMO with an invalid remote-write endpoint (e.g. invalid URL or authentication).
      2.
      3.
      

      Actual results:

      The PrometheusRemoteStorageFailures alert fires but it provides little information about what's the issue and how to remediate.

      Expected results:

      The PrometheusRemoteStorageFailures alert fires and there's a link to a runbook in https://github.com/openshift/runbooks.

      Additional info:

      Reported via the internal OpenShift SME mailing list.

              rhn-support-bburt Brian Burt
              spasquie@redhat.com Simon Pasquier
              Tai Gao Tai Gao
              Brian Burt Brian Burt
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

                Created:
                Updated:
                Resolved: