Loading...

XML

Word

Printable

Type: Epic
Resolution: Done
Priority: Normal
Fix Version/s: openshift-4.17
Affects Version/s: None
Component/s: SNO
Labels:
- 4.17-candidate
- ocpedge-plan

Epic Name:
Enable Workload Partitioning Metrics for SNO Alerting
Epic Status:
Done
Activity Type:
Future Sustainability
Hierarchy Progress Bar:

0% To Do, 0% In Progress, 100% Done
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Color Status:
Green
Size:
M

Target Version:

openshift-4.17
Release Blocker:
None

Story Points:
11

Goal

The goal of this epic is to adjust HighOverallControlPlaneCPU alert thresholds when Workload Partitioning is enabled.

Why is this important?

On SNO clusters this might lead to false positives. Also, it makes sense to have such mechanism because right now it's using all available CPU for control plane alert, while user can allow less cores for it to be used

Scenarios

As a user i want to enable workload partitioning and have my alert values adjusted accordingly

Acceptance Criteria

CI - MUST be running successfully with tests automated
Release Technical Enablement
...

Open questions:

Do we want to bump alert threshold for SNO clusters because they are running workloads on master nodes, rather than worker nodes

Done Checklist

CI - CI is running, tests are automated and merged.
Release Technical Enablement <link to Feature Enablement Presentation>
DEV - Upstream code and tests merged: <link to meaningful PR or GitHub Issue>
DEV - Upstream documentation merged: <link to meaningful PR or GitHub Issue>
DEV - Downstream build attached to advisory: <link to errata>
QE - Test plans in Polarion: <link or reference to Polarion>
QE - Automated tests merged: <link or reference to automated tests>
DOC - Downstream documentation merged: <link to meaningful PR>

is depended on by

OCPBUGS-35833 Misleading alert regarding high control plane CPU utilization in Single Node OpenShift (SNO) cluster

Closed

is related to

OCPBUGS-31354 [release-4.14] Misleading alert regarding high control plane CPU utilization in Single Node OpenShift (SNO) cluster

Closed

split from

OCPBUGS-22117 HighOverallControlPlaneCPU alert for SNO has wrong threshold

Closed

links to

openshift/cluster-kube-apiserver-operator#1676: OCPEDGE-902: add SNO control plane high cpu usage alert

Assignee:: Bulat Zamalutdinov

Reporter:: Chad Scribner

Contributors:: None

QA Contact:: Ke Wang

Doc Contact:: Daniel Macpherson

SME:: Egli Hila

Votes:: 0 Vote for this issue

Watchers:: 9 Start watching this issue

Created:: 2024/01/26 11:42 AM

Updated:: 2025/09/16 11:23 AM

Resolved:: 2024/07/09 9:37 AM