Loading...

XML

Word

Printable

Type: Bug
Resolution: Not a Bug
Priority: Normal
Fix Version/s: None
Affects Version/s: 4.12.z
Component/s: kube-controller-manager
Labels:

Severity:
Moderate
Regression:
No
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Internal Whiteboard:
Latest Status Summary:

Hide
8/17: seeking review of PR; KNIECO-7492
8/1: a fix is proposed by Vitaly Grinberg which needs to be reviewed by the component owners.

Show
8/17: seeking review of PR; KNIECO-7492 8/1: a fix is proposed by Vitaly Grinberg which needs to be reviewed by the component owners.
RH Private Keywords:
Target Version:

4.14.0

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Impact Score:
PX Priority Data:

Description of problem:

KCM will restart when leader election failed due to the restart of kube-apiserver. This is because the leaseDurationSeconds for KCM is too short in SNO scenario, which is 15 seconds.

Version-Release number of selected component (if applicable):

OCP 4.12

How reproducible:

100%

Steps to Reproduce:

1.Kill kube-apiserver in SNO
$ oc exec -it -n openshift-kube-apiserver kube-apiserver-XXXXXX  -c kube-apiserver -- /bin/sh -c "kill 1" 
2. Watch the KCM Pods 
3.

Actual results:

The KCM will crash and restart

Expected results:

The KCM should survive

Additional info:

For other components, the leaseDurationSeconds almost meet this criteria:
https://github.com/openshift/library-go/blob/6ac65c5454f9effede61a6e52e7fdb06a27fc26e/pkg/config/leaderelection/leaderelection.go#L148

links to

openshift/cluster-kube-controller-manager-operator#741: Attain leader election times from topology setting

Assignee:: Filip Krepinsky

Reporter:: Chen Chen

QA Contact:: Ying Zhou

Votes:: 0 Vote for this issue

Watchers:: 8 Start watching this issue

Created:: 2023/04/06 1:53 PM

Updated:: 2024/06/05 3:52 AM

Resolved:: 2023/09/26 12:06 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates