Loading...

XML

Word

Printable

Type: Bug
Resolution: Unresolved
Priority: Critical
Fix Version/s: None
Affects Version/s: 4.12
Component/s: Telco Performance
Labels:
- system-test
- telco-priority-3

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
None
Regression:
None

Target Backport Versions:
None
Target Version:
None
Release Blocker:
Rejected
Sprint:
None

Internal Whiteboard:

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Impact Score:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Description of problem:

When running a vdu test app workload on an SNO with DU profile running non-rt kernel the load average increases to ~130. While the node is under load, some of the times the kube api cannot recover following a rollout.

Version-Release number of selected component (if applicable):

4.12.0-rc.0 with 4.18.0-372.36.1.el8_6.x86_64 kernel

How reproducible:

Consistently

Steps to Reproduce:

1. Deploy and configure SNO with DU profile with 4.18.0-372.36.1.el8_6.x86_64 kernel

2. Deploy test app

3. Force a kube api server rollout:
oc patch kubeapiserver cluster -p='{"spec": {"forceRedeploymentReason": "recovery-'"$( date --rfc-3339=ns )"'"}}' --type=merge 

4. Wait for kube-apiserver to achieve a new revision

Actual results:

Some of the times the kube-apiserver doesn't recover and remains unreachable.

Expected results:

kube-apiserver always recovers

Additional info:

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

dmesg
225 kB
2022/11/25 7:54 PM

Assignee:: Bart Wensley

Reporter:: Marius Cornea

Need Info From:: None

Contributors:: None

QA Contact:: Marius Cornea

Doc Contact:: None

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Created:: 2022/11/25 1:27 PM

Updated:: 2025/07/28 5:42 PM