Uploaded image for project: 'Red Hat OpenShift Data Science'
  1. Red Hat OpenShift Data Science
  2. RHODS-1832

Error in alert "An OpenShift pod has just restarted"

XMLWordPrintable

    • IDH Sprint 9, IDH Sprint 10, IDH Sprint 11, IDH Sprint 12, IDH Sprint 13, IDH Sprint 14, IDH Sprint 15, IDH Sprint 16, IDH Sprint 17

      Description of problem:

       

      If you go to Prometheus > Alerts you'll see the alert "An OpenShift pod has just restarted"

       

      After talking with aasthana@redhat.com we found that this alert might have at least 2 errors:

      • The query has 
        !~
        

        instead of

        =~
        
      • This alert should be send to users but it's being send to SRe

       

      https://github.com/red-hat-data-services/odh-deployer/blob/b11a3deb5c77caf40c6abdfcb937f4856d9a2839/monitoring/prometheus/prometheus.yaml#L502
       

       

      Reproducibility (Always/Intermittent/Only Once):

      Always

      Build Details:

      RHODS 1.1.1-22

        1. openshift-pod-restarted-alert.png
          53 kB
          Jorge Garcia Oncins
        2. openshift-pod-restarted-alert-definition.png
          37 kB
          Jorge Garcia Oncins
        3. rhods-1832-alerts.png
          103 kB
          Jorge Garcia Oncins

              aasthana@redhat.com Anish Asthana
              rhn-support-jgarciao Jorge Garcia Oncins
              Jorge Garcia Oncins Jorge Garcia Oncins
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

                Created:
                Updated:
                Resolved: