Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Normal
Fix Version/s: None
Affects Version/s: 4.12
Component/s: Installer / Single Node OpenShift
Labels:
- ocpedge
- sno

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
3
Severity:
Moderate
Regression:
No
Latest Status Summary:
After speaking with monitoring team, i'll proceed with measuring cpu usage of control plane components described in the openshift docs

Target Backport Versions:
None
Target Version:

4.16.0
Release Blocker:
None
Sprint:
OCP VE Sprint 244, OCPEDGE Sprint 248, OCPEDGE Sprint 249, OCPEDGE Sprint 250, OCPEDGE Sprint 251, OCPEDGE Sprint 252
sprint_count:
6

RH Private Keywords:

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Review Complete:
PX Priority Data:
PX Impact Score:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Description of problem:

HighOverallControlPlaneCPU alert for SNO is set to the default from MNO, which is 60% of master node capacity. FOR SNO: This leads to either false positive alerts (in small SNO clusters with e.g. only 4 cores), or false negative alerts (e.g no alert on a cluster with 40 cores, and control plane consuming 50% cores).

Version-Release number of selected component (if applicable):

4.12

How reproducible:

always

Steps to Reproduce:

1. Install SNO
2. take a look at "$ oc get prometheusrules cpu-utilization -n openshift-kube-apiserver -o yaml | grep HighOverallControlPlaneCPU
..."
 3. see it to be ">60"

Actual results:

see it to be ">60"

Expected results:

adjusted for SNO to a sensible value

Additional info:

causes

OCPBUGS-27842 Add SNO to HighOverallControlPlaneCPU alert description

Closed

OCPBUGS-28881 Create separate HighOverallControlPlaneCPU SNO alert

Closed

is related to

OCPBUGS-31354 [release-4.14] Misleading alert regarding high control plane CPU utilization in Single Node OpenShift (SNO) cluster

Closed

relates to

RFE-4714 adjust HighOverallControlPlaneCPU alert for SNO

Closed

split to

OCPEDGE-827 Enable Workload Partitioning Metrics for SNO Alerting

Closed

Assignee:: Bulat Zamalutdinov

Reporter:: Daniel Fröhlich

Need Info From:: None

Contributors:: Chad Scribner

QA Contact:: Pedro Jose Amoedo Martinez

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 17 Start watching this issue

Created:: 2023/10/19 1:56 PM

Updated:: 2025/09/13 2:22 PM

Resolved:: 2024/04/23 9:17 AM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates