Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-62227

case "Alerts shouldn't exceed the series limit of total series sent via telemetry from each cluster" failed on e2e-aws-ovn-techpreview job

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Normal Normal
    • 4.21.0
    • 4.21.0
    • Monitoring
    • None
    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • Moderate
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Description of problem:

      case "Alerts shouldn't exceed the series limit of total series sent via telemetry from each cluster" failed on e2e-aws-ovn-techpreview job recently, example

      https://prow.ci.openshift.org/view/gs/test-platform-results/pr-logs/pull/openshift_cluster-monitoring-operator/2679/pull-ci-openshift-cluster-monitoring-operator-main-e2e-aws-ovn-techpreview/1970021981713600512

      :
       [sig-instrumentation][Late] Alerts shouldn't exceed the series limit of
       total series sent via telemetry from each cluster 
      [Suite:openshift/conformance/parallel]
       {  fail [github.com/openshift/origin/test/extended/prometheus/prometheus.go:564]: Unexpected error:
          <errors.aggregate | len:1, cap:1>: 
          promQL query returned unexpected results:
          avg_over_time(cluster:telemetry_selected_series:count[1h22m50s]) >= 780
          [
            {
              "metric": {
                "prometheus": "openshift-monitoring/k8s"
              },
              "value": [
                1758534739.622,
                "784.5"
              ]
            }
          ]
          [
              <*errors.errorString | 0xc00735a8b0>{
                  s: "promQL query returned unexpected results:\navg_over_time(cluster:telemetry_selected_series:count[1h22m50s]) >= 780\n[\n  {\n    \"metric\": {\n      \"prometheus\": \"openshift-monitoring/k8s\"\n    },\n    \"value\": [\n      1758534739.622,\n      \"784.5\"\n    ]\n  }\n]",
              },
          ] 

      https://prow.ci.openshift.org/view/gs/test-platform-results/pr-logs/pull/openshift_cluster-monitoring-operator/2656/pull-ci-openshift-cluster-monitoring-operator-main-e2e-aws-ovn-techpreview/1971031731083415552

      :
       [sig-instrumentation][Late] Alerts shouldn't exceed the series limit of
       total series sent via telemetry from each cluster 
      [Suite:openshift/conformance/parallel]
      {  fail [github.com/openshift/origin/test/extended/prometheus/prometheus.go:564]: Unexpected error:
          <errors.aggregate | len:1, cap:1>: 
          promQL query returned unexpected results:
          avg_over_time(cluster:telemetry_selected_series:count[1h37m56s]) >= 780
          [
            {
              "metric": {
                "prometheus": "openshift-monitoring/k8s"
              },
              "value": [
                1758776352.201,
                "788.4744897959183"
              ]
            }
          ]
          [
              <*errors.errorString | 0xc0074b8030>{
                  s: "promQL query returned unexpected results:\navg_over_time(cluster:telemetry_selected_series:count[1h37m56s]) >= 780\n[\n  {\n    \"metric\": {\n      \"prometheus\": \"openshift-monitoring/k8s\"\n    },\n    \"value\": [\n      1758776352.201,\n      \"788.4744897959183\"\n    ]\n  }\n]",
              },
          ] 

      https://prow.ci.openshift.org/view/gs/test-platform-results/pr-logs/pull/openshift_cluster-monitoring-operator/2681/pull-ci-openshift-cluster-monitoring-operator-main-e2e-aws-ovn-techpreview/1970831306245804032

      :
       [sig-instrumentation][Late] Alerts shouldn't exceed the series limit of
       total series sent via telemetry from each cluster 
      [Suite:openshift/conformance/parallel]
      {  fail [github.com/openshift/origin/test/extended/prometheus/prometheus.go:564]: Unexpected error:
          <errors.aggregate | len:1, cap:1>: 
          promQL query returned unexpected results:
          avg_over_time(cluster:telemetry_selected_series:count[1h39m48s]) >= 780
          [
            {
              "metric": {
                "prometheus": "openshift-monitoring/k8s"
              },
              "value": [
                1758728450.108,
                "788.0301507537688"
              ]
            }
          ]
          [
              <*errors.errorString | 0xc0060bb0d0>{
                  s: "promQL query returned unexpected results:\navg_over_time(cluster:telemetry_selected_series:count[1h39m48s]) >= 780\n[\n  {\n    \"metric\": {\n      \"prometheus\": \"openshift-monitoring/k8s\"\n    },\n    \"value\": [\n      1758728450.108,\n      \"788.0301507537688\"\n    ]\n  }\n]",
              },
          ]
      occurred} 

      maybe change the limit to 790 is fine

      https://github.com/openshift/origin/blob/main/test/extended/prometheus/prometheus.go#L544

      Version-Release number of selected component (if applicable):

      4.21

      How reproducible:

      frequently

      Steps to Reproduce:

      1. see the steps

      Actual results:

      case "Alerts shouldn't exceed the series limit of total series sent via telemetry from each cluster" failed on e2e-aws-ovn-techpreview job

      Expected results:

      pass without error    

      Additional info:

          

              juzhao@redhat.com Junqi Zhao
              juzhao@redhat.com Junqi Zhao
              None
              None
              Junqi Zhao Junqi Zhao
              None
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated: