Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-64927

update telemtery limit for 4.17 e2e case

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Undefined Undefined
    • None
    • 4.17.z
    • Monitoring
    • None
    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • None
    • None
    • MON Sprint 279
    • 1
    • In Progress
    • Release Note Not Required
    • None
    • None
    • None
    • None
    • None

      Description of problem:

      4.17 PR https://github.com/openshift/cluster-monitoring-operator/pull/2732

      e2e-aws-ovn-techpreview job failed frequently, checked the history jobs

      https://prow.ci.openshift.org/pr-history/?org=openshift&repo=cluster-monitoring-operator&pr=2732

      for example

      https://prow.ci.openshift.org/view/gs/test-platform-results/pr-logs/pull/openshift_cluster-monitoring-operator/2732/pull-ci-openshift-cluster-monitoring-operator-release-4.17-e2e-aws-ovn-techpreview/1987875064694968320

      case "Alerts shouldn't exceed the series limit of total series sent via telemetry from each cluster" failed for exceeded its limti

       [sig-instrumentation][Late] Alerts shouldn't exceed the series limit of
       total series sent via telemetry from each cluster 
      [Suite:openshift/conformance/parallel] expand_less
      {  fail [github.com/openshift/origin/test/extended/prometheus/prometheus.go:408]: Unexpected error:
          <errors.aggregate | len:1, cap:1>: 
          promQL query returned unexpected results:
          avg_over_time(cluster:telemetry_selected_series:count[49m5s]) >= 760
          [
            {
              "metric": {
                "prometheus": "openshift-monitoring/k8s"
              },
              "value": [
                1762788231.919,
                "766.5510204081634"
              ]
            }
          ]
          [
              <*errors.errorString | 0xc0019afbb0>{
                  s: "promQL query returned unexpected results:\navg_over_time(cluster:telemetry_selected_series:count[49m5s]) >= 760\n[\n  {\n    \"metric\": {\n      \"prometheus\": \"openshift-monitoring/k8s\"\n    },\n    \"value\": [\n      1762788231.919,\n      \"766.5510204081634\"\n    ]\n  }\n]",
              },
          ]
      occurred
      Ginkgo exit error 1: exit with code 1} 

      since https://github.com/openshift/origin/blob/release-4.18/test/extended/prometheus/prometheus.go#L426 is bumped the limit to 780, bump to 780 for 4.17 will make the case passed

      Version-Release number of selected component (if applicable):

      4.17.z

      How reproducible:

      frequently    

      Steps to Reproduce:

      1. check failed jobs for 4.17 e2e-aws-ovn-techpreview job

      Actual results:

      case "Alerts shouldn't exceed the series limit of total series sent via telemetry from each cluster" failed

      Expected results:

      pass    

      Additional info:

          

              juzhao@redhat.com Junqi Zhao
              juzhao@redhat.com Junqi Zhao
              None
              None
              Junqi Zhao Junqi Zhao
              None
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated:
                Resolved: