Loading...

XML

Word

Printable

Type: Bug
Resolution: Cannot Reproduce
Priority: Undefined
Fix Version/s: None
Affects Version/s: 4.12.z
Component/s: kube-apiserver
Labels:

Activity Type:
Quality / Stability / Reliability
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Story Points:
None
Severity:
Important
Regression:
No
Latest Status Summary:

Hide
7/5: pending repro --> more/better logs
6/7: telco review pending triage

Show
7/5: pending repro --> more/better logs 6/7: telco review pending triage

Target Backport Versions:
None
Target Version:
None
Release Blocker:
Rejected
Sprint:
None

Internal Whiteboard:
RH Private Keywords:

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Priority Data:
PX Impact Score:

Release Note Status:
None
Release Note Type:
None
Release Note Text:
None

Escape Reason:
None
Escape Impact:
None
Corrective Measures:
None
SDLC stage when should've been found:
None

Description of problem:


After performing several reboots in a row, SNO cluster API does not respond anymore:

The connection to the server api.cloudransno-site1.slcm1.bos2.lab:6443 was refused - did you specify the right host or port?

Version-Release number of selected component (if applicable):

4.12.16

How reproducible:


We run a test that performs several reboot in a row. We see this issue with a high rate every time we run that test. We say in 4.12.16 100%of times, and now also in 4.12.21 happened the first time we run the test.

Steps to Reproduce:

1. Reboot SNO cluster 5 times
2. Check API

Actual results:


Node does not respond anymore. I left it several hours but it did not come back.

Expected results:


Node recovers properly

Additional info:


System Impact: Very severe. Node cannot be longer used

ACM reports: 

The kube-apiserver is not ok, status code: 0, Get "https://172.31.0.1:443/livez": dial tcp 172.31.0.1:443: connect: connection refused

oc adm must gather cannot be performed. Only SOS report. Logs attached

Assignee:: Rodrigo Lopez Manrique (Inactive)

Reporter:: Rodrigo Lopez Manrique (Inactive)

QA Contact:: Ke Wang

Need Info From:: None

Votes:: 0 Vote for this issue

Watchers:: 9 Start watching this issue

Created:: 2023/06/01 7:59 AM

Updated:: 2025/09/13 2:51 AM

Resolved:: 2023/07/26 12:44 PM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates