Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Undefined
Fix Version/s: 4.17.z
Affects Version/s: 4.17.z
Component/s: Monitoring
Labels:
None

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
None
Regression:
None

Target Backport Versions:
None
Target Version:

4.17.z
Release Blocker:
None
Sprint:
MON Sprint 279
sprint_count:
1

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Impact Score:

Release Note Status:
In Progress
Release Note Type:
Release Note Not Required
Release Note Text:
None

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Description of problem:

4.17 PR https://github.com/openshift/cluster-monitoring-operator/pull/2732

e2e-aws-ovn-techpreview job failed frequently, checked the history jobs

https://prow.ci.openshift.org/pr-history/?org=openshift&repo=cluster-monitoring-operator&pr=2732

for example

https://prow.ci.openshift.org/view/gs/test-platform-results/pr-logs/pull/openshift_cluster-monitoring-operator/2732/pull-ci-openshift-cluster-monitoring-operator-release-4.17-e2e-aws-ovn-techpreview/1987875064694968320

case "Alerts shouldn't exceed the series limit of total series sent via telemetry from each cluster" failed for exceeded its limti

 [sig-instrumentation][Late] Alerts shouldn't exceed the series limit of
 total series sent via telemetry from each cluster 
[Suite:openshift/conformance/parallel] expand_less
{  fail [github.com/openshift/origin/test/extended/prometheus/prometheus.go:408]: Unexpected error:
    <errors.aggregate | len:1, cap:1>: 
    promQL query returned unexpected results:
    avg_over_time(cluster:telemetry_selected_series:count[49m5s]) >= 760
    [
      {
        "metric": {
          "prometheus": "openshift-monitoring/k8s"
        },
        "value": [
          1762788231.919,
          "766.5510204081634"
        ]
      }
    ]
    [
        <*errors.errorString | 0xc0019afbb0>{
            s: "promQL query returned unexpected results:\navg_over_time(cluster:telemetry_selected_series:count[49m5s]) >= 760\n[\n  {\n    \"metric\": {\n      \"prometheus\": \"openshift-monitoring/k8s\"\n    },\n    \"value\": [\n      1762788231.919,\n      \"766.5510204081634\"\n    ]\n  }\n]",
        },
    ]
occurred
Ginkgo exit error 1: exit with code 1}

since https://github.com/openshift/origin/blob/release-4.18/test/extended/prometheus/prometheus.go#L426 is bumped the limit to 780, bump to 780 for 4.17 will make the case passed

Version-Release number of selected component (if applicable):

4.17.z

How reproducible:

frequently

Steps to Reproduce:

1. check failed jobs for 4.17 e2e-aws-ovn-techpreview job

Actual results:

case "Alerts shouldn't exceed the series limit of total series sent via telemetry from each cluster" failed

Expected results:

pass

Additional info:

links to

openshift/origin#30472: OCPBUGS-64927: update telemtery limit for 4.17 e2e case

Assignee:: Junqi Zhao

Reporter:: Junqi Zhao

Need Info From:: None

Contributors:: None

QA Contact:: Junqi Zhao

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Created:: 2025/11/11 7:26 AM

Updated:: 2025/11/13 4:18 AM

Resolved:: 2025/11/12 1:52 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates