-
Bug
-
Resolution: Done-Errata
-
Critical
-
4.14.0
-
Critical
-
No
-
Proposed
-
False
-
-
-
Known Issue
-
Done
-
-
8/1: critical/blocker, blocking CI lanes
Description of problem:
Critical Features of Node tuning operator provide using PAO like cpu load balancing, quota etc are failing
Example starting GU pods for example Running latency tests oslat etc fail with error:
---- ------ ---- ---- ------- Normal Scheduled 25s default-scheduler Successfully assigned default/pod1 to ocp-worker-0.libvirt.lab.eng.tlv2.redhat.com Normal AddedInterface 24s multus Add eth0 [10.135.0.82/23] from ovn-kubernetes Normal Pulling 24s kubelet Pulling image "quay.io/openshift-kni/cnf-tests:4.13" Normal Pulled 3s kubelet Successfully pulled image "quay.io/openshift-kni/cnf-tests:4.13" in 21.459560425s (21.459579982s including waiting) Normal Pulled 2s kubelet Container image "quay.io/openshift-kni/cnf-tests:4.13" already present on machine Normal Created 1s (x2 over 2s) kubelet Created container test-container1 Warning Failed 1s (x2 over 2s) kubelet Error: failed to run pre-start hook for container "test-container1": set CPU load balancing: disabling CPU load balancing on cgroupv2 not yet supported
Version-Release number of selected component (if applicable):
4.14.0-0.nightly-2023-07-27-172239
How reproducible:
everytime
Steps to Reproduce:
1. Install OCP version 4.14 2. Apply Performance profile 3. Create a gu pod with cri-o annotation to disable cpu load balance: apiVersion: v1 kind: Pod metadata: name: pod1 annotations: cpu-load-balancing.crio.io: "disable" labels: name: "cpuloadbalancing1" spec: containers: - name: test-container1 image: quay.io/openshift-kni/cnf-tests:4.13 command: - sleep - inf resources: limits: memory: "100Mi" cpu: "4" requests: memory: "100Mi" cpu: "4" runtimeClassName: performance-performance nodeSelector: kubernetes.io/hostname: ocp-worker-0.libvirt.lab.eng.tlv2.redhat.com 4. Apply the above yaml
Actual results:
Events: Type Reason Age From Message ---- ------ ---- ---- ------- Normal Scheduled 25s default-scheduler Successfully assigned default/pod1 to ocp-worker-0.libvirt.lab.eng.tlv2.redhat.com Normal AddedInterface 24s multus Add eth0 [10.135.0.82/23] from ovn-kubernetes Normal Pulling 24s kubelet Pulling image "quay.io/openshift-kni/cnf-tests:4.13" Normal Pulled 3s kubelet Successfully pulled image "quay.io/openshift-kni/cnf-tests:4.13" in 21.459560425s (21.459579982s including waiting) Normal Pulled 2s kubelet Container image "quay.io/openshift-kni/cnf-tests:4.13" already present on machine Normal Created 1s (x2 over 2s) kubelet Created container test-container1 Warning Failed 1s (x2 over 2s) kubelet Error: failed to run pre-start hook for container "test-container1": set CPU load balancing: disabling CPU load balancing on cgroupv2 not yet supported
Expected results:
Pod should be running and cpu load balancing should be disabled.
Additional info:
- causes
-
OCPBUGS-17858 masters reboot after applying PerformanceProfile for workers
- Closed
- is caused by
-
OCPSTRAT-696 Make CgroupV2 default in 4.14
- Closed
- links to
-
RHSA-2023:5006 OpenShift Container Platform 4.14.z security update