Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Normal
Fix Version/s: 4.21.0
Affects Version/s: 4.21.0
Component/s: Monitoring
Labels:
None

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
Moderate
Regression:
None

Target Backport Versions:
None
Target Version:

4.21.0
Release Blocker:
None
Sprint:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Impact Score:

Release Note Status:
Done
Release Note Type:
Release Note Not Required
Release Note Text:
N/A

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Description of problem:

case "Alerts shouldn't exceed the series limit of total series sent via telemetry from each cluster" failed on e2e-aws-ovn-techpreview job recently, example

https://prow.ci.openshift.org/view/gs/test-platform-results/pr-logs/pull/openshift_cluster-monitoring-operator/2679/pull-ci-openshift-cluster-monitoring-operator-main-e2e-aws-ovn-techpreview/1970021981713600512

:
 [sig-instrumentation][Late] Alerts shouldn't exceed the series limit of
 total series sent via telemetry from each cluster 
[Suite:openshift/conformance/parallel]
 {  fail [github.com/openshift/origin/test/extended/prometheus/prometheus.go:564]: Unexpected error:
    <errors.aggregate | len:1, cap:1>: 
    promQL query returned unexpected results:
    avg_over_time(cluster:telemetry_selected_series:count[1h22m50s]) >= 780
    [
      {
        "metric": {
          "prometheus": "openshift-monitoring/k8s"
        },
        "value": [
          1758534739.622,
          "784.5"
        ]
      }
    ]
    [
        <*errors.errorString | 0xc00735a8b0>{
            s: "promQL query returned unexpected results:\navg_over_time(cluster:telemetry_selected_series:count[1h22m50s]) >= 780\n[\n  {\n    \"metric\": {\n      \"prometheus\": \"openshift-monitoring/k8s\"\n    },\n    \"value\": [\n      1758534739.622,\n      \"784.5\"\n    ]\n  }\n]",
        },
    ]

https://prow.ci.openshift.org/view/gs/test-platform-results/pr-logs/pull/openshift_cluster-monitoring-operator/2656/pull-ci-openshift-cluster-monitoring-operator-main-e2e-aws-ovn-techpreview/1971031731083415552

:
 [sig-instrumentation][Late] Alerts shouldn't exceed the series limit of
 total series sent via telemetry from each cluster 
[Suite:openshift/conformance/parallel]
{  fail [github.com/openshift/origin/test/extended/prometheus/prometheus.go:564]: Unexpected error:
    <errors.aggregate | len:1, cap:1>: 
    promQL query returned unexpected results:
    avg_over_time(cluster:telemetry_selected_series:count[1h37m56s]) >= 780
    [
      {
        "metric": {
          "prometheus": "openshift-monitoring/k8s"
        },
        "value": [
          1758776352.201,
          "788.4744897959183"
        ]
      }
    ]
    [
        <*errors.errorString | 0xc0074b8030>{
            s: "promQL query returned unexpected results:\navg_over_time(cluster:telemetry_selected_series:count[1h37m56s]) >= 780\n[\n  {\n    \"metric\": {\n      \"prometheus\": \"openshift-monitoring/k8s\"\n    },\n    \"value\": [\n      1758776352.201,\n      \"788.4744897959183\"\n    ]\n  }\n]",
        },
    ]

https://prow.ci.openshift.org/view/gs/test-platform-results/pr-logs/pull/openshift_cluster-monitoring-operator/2681/pull-ci-openshift-cluster-monitoring-operator-main-e2e-aws-ovn-techpreview/1970831306245804032

:
 [sig-instrumentation][Late] Alerts shouldn't exceed the series limit of
 total series sent via telemetry from each cluster 
[Suite:openshift/conformance/parallel]
{  fail [github.com/openshift/origin/test/extended/prometheus/prometheus.go:564]: Unexpected error:
    <errors.aggregate | len:1, cap:1>: 
    promQL query returned unexpected results:
    avg_over_time(cluster:telemetry_selected_series:count[1h39m48s]) >= 780
    [
      {
        "metric": {
          "prometheus": "openshift-monitoring/k8s"
        },
        "value": [
          1758728450.108,
          "788.0301507537688"
        ]
      }
    ]
    [
        <*errors.errorString | 0xc0060bb0d0>{
            s: "promQL query returned unexpected results:\navg_over_time(cluster:telemetry_selected_series:count[1h39m48s]) >= 780\n[\n  {\n    \"metric\": {\n      \"prometheus\": \"openshift-monitoring/k8s\"\n    },\n    \"value\": [\n      1758728450.108,\n      \"788.0301507537688\"\n    ]\n  }\n]",
        },
    ]
occurred}

maybe change the limit to 790 is fine

https://github.com/openshift/origin/blob/main/test/extended/prometheus/prometheus.go#L544

Version-Release number of selected component (if applicable):

4.21

How reproducible:

frequently

Steps to Reproduce:

1. see the steps

Actual results:

case "Alerts shouldn't exceed the series limit of total series sent via telemetry from each cluster" failed on e2e-aws-ovn-techpreview job

Expected results:

pass without error

Additional info:

links to

openshift/origin#30302: OCPBUGS-62227: bump telemetry series limit to 1000

Assignee:: Junqi Zhao

Reporter:: Junqi Zhao

QA Contact:: Junqi Zhao

Need Info From:: None

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2025/09/25 9:19 AM

Updated:: 2026/02/10 9:53 AM

Resolved:: 2026/02/10 9:53 AM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates