Loading...

XML

Word

Printable

Type: Bug
Resolution: Can't Do
Priority: Critical
Fix Version/s: None
Affects Version/s: 4.14.z
Component/s: kube-apiserver
Labels:

Severity:
Critical
Regression:
No
Blocked:
False
Blocked Reason:

Hide

None

Show
None
RH Private Keywords:
Target Version:

4.14.z

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Impact Score:

When running the 12h stability test with stress-ng with 60% load the API-server was restarted and never recover there for the all the stress pods stay running and continue loading the cluster. From the console it was visible that cgroup oom happend. Master0 and 2 was not possible to access via SSH. Master 1 was reachable but could not execute any oc-command. After restart master 2 via KVM it was possible to delete stress deployment. however master 0 and 2 needed to restart again that cluster started to be more stabile again.

it was confirmed that the 4.14.4 set being used was identical to the GA version.

Assignee:: Unassigned

Reporter:: Vishvranjan Mishra

QA Contact:: Xingxing Xia

Votes:: 0 Vote for this issue

Watchers:: 5 Start watching this issue

Created:: 2023/11/30 4:36 PM

Updated:: 2024/10/21 5:40 PM

Resolved:: 2023/12/01 7:59 AM