-
Bug
-
Resolution: Done
-
Major
-
None
-
4.20
-
Quality / Stability / Reliability
-
False
-
-
None
-
None
-
None
-
None
-
None
-
None
-
OCP Node Sprint 274 (green)
-
1
-
Done
-
Bug Fix
-
-
None
-
None
-
None
-
None
thread for context - https://redhat-internal.slack.com/archives/C084N2C6P9U/p1751457510022409
Description of problem:
Kueue controller manager crashes frequently when it loses its lease.
Version-Release number of selected component (if applicable):
0.2.0
How reproducible:
Everytime
Steps to Reproduce:
Run the controller in cluster with high load which causes etcd to start fragmentation.
Actual results:
Expected results:
The controller should tolerate 1 minutes of api unavailability as recommended in https://github.com/openshift/enhancements/blob/0f916a52af1a6fbdab0c5b80ae0e66c7a27efb6a/CONVENTIONS.md#handling-kube-apiserver-disruption
Additional info:
- is duplicated by
-
RFE-7805 Allow to configure lease parameters for the kueue manager controller
-
- Closed
-
- links to