-
Bug
-
Resolution: Done-Errata
-
Critical
-
None
-
4.16.0
-
Critical
-
No
-
CNF Compute Sprint 250, CNF Compute Sprint 251
-
2
-
Rejected
-
False
-
-
Release Note Not Required
-
In Progress
-
-
-
Description of problem:
On system with workload partitioning enabled setting cpu_manager to static and having reserved cpu causes kubelet fail to restart
Version-Release number of selected component (if applicable):
4.16.0-0.ci-2024-02-13-072746
How reproducible:
Everytime
Steps to Reproduce:
1. Enable workload partitioning 2. Label one of the worker nodes as worker-test 3. Create a mcp for worker-test node. 4. Create a kubelet config as show below: apiVersion: machineconfiguration.openshift.io/v1 kind: KubeletConfig metadata: name: kubelet-test spec: kubeletConfig: cpuManagerPolicy: static reservedSystemCPUs: 0,2,12,14 machineConfigPoolSelector: matchLabels: machineconfiguration.openshift.io/role: worker-test 5. Node goes to NotReady,SchedulingDisabled mode:
Actual results:
Node goes to NotReady,SchedulingDisabled mode [root@cnfdr22 tmp]# oc get nodes NAME STATUS ROLES AGE VERSION ocp-ctlplane-0.libvirt.lab.eng.tlv2.redhat.com Ready control-plane,master 116m v1.29.1+2f773e8 ocp-ctlplane-1.libvirt.lab.eng.tlv2.redhat.com Ready control-plane,master 116m v1.29.1+2f773e8 ocp-ctlplane-2.libvirt.lab.eng.tlv2.redhat.com Ready control-plane,master 116m v1.29.1+2f773e8 ocp-worker-0.libvirt.lab.eng.tlv2.redhat.com Ready worker 100m v1.29.1+2f773e8 ocp-worker-1.libvirt.lab.eng.tlv2.redhat.com Ready worker 100m v1.29.1+2f773e8 ocp-worker-2.libvirt.lab.eng.tlv2.redhat.com NotReady,SchedulingDisabled worker,worker-test 100m v1.29.1+2f773e8
Expected results:
Node should not go in to NotReady, SchedulingDisabled mode.
Additional info:
- blocks
-
OCPBUGS-31348 With workload partitioning enabled, setting cpu_manager to static and having reserved cpu causes kubelet fail to restart
- Closed
- is cloned by
-
OCPBUGS-31348 With workload partitioning enabled, setting cpu_manager to static and having reserved cpu causes kubelet fail to restart
- Closed
- links to
-
RHEA-2024:0041 OpenShift Container Platform 4.16.z bug fix update