Loading...

XML

Word

Printable

Type: Bug
Resolution: Won't Do
Priority: Normal
Fix Version/s: None
Affects Version/s: 4.11.z
Component/s: Monitoring
Labels:
None

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
Moderate
Regression:
None

Target Backport Versions:
None
Target Version:
None
Release Blocker:
None
Sprint:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Description of problem:

We consistently see pod `prometheus-k8s-1` is being killed and restarted.
1. Restarting is stuck due to WAL reply
2. Pod consumes >17G memory before being killed, which is well beyond the request memory setting (https://github.com/openshift/release/blob/4ad2f102f3b6ff11b1a77331b9f788558c56b548/clusters/build-clusters/01_cluster/openshift-monitoring/cluster-monitoring-config_configmap.yaml#L26)

Version-Release number of selected component (if applicable):

4.11.4

How reproducible:

It just happens to build01, no  specific step to trigger the issue.

Steps to Reproduce:

N/A

Actual results:

Name Status Ready Restarts Owner Memory CPU Created
prometheus-k8s-1 Running	5/6	41 prometheus-k8s	1,521.2 MiB	0.400 cores	Sep 16, 2022, 5:24 AM

Expected results:

1. prometheus-k8s works fine
2. No alarms PrometheusNotConnectedToAlertmanagers is fired

Additional info:

N/A

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

image-2022-09-19-10-21-30-195.png
103 kB
2022/09/19 8:21 AM
inspect.local.4538195566830101267.tar.gz
2.75 MB
2022/09/26 5:48 PM
inspect.local.8952935911337421758.tar.gz
2.65 MB
2022/09/16 3:55 PM
promethus-k8s-1.log
39 kB
2022/09/26 5:46 PM
screenshot-1.png
54 kB
2022/09/16 3:34 PM
screenshot-2.png
34 kB
2022/09/16 3:37 PM

Assignee:: Haoyu Sun

Reporter:: Bear Chen

QA Contact:: Junqi Zhao

Need Info From:: None

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Created:: 2022/09/16 2:30 PM

Updated:: 2025/07/29 5:49 AM

Resolved:: 2023/03/10 8:08 PM