-
Bug
-
Resolution: Done
-
Critical
-
None
-
4.12.z
-
Quality / Stability / Reliability
-
False
-
-
None
-
Critical
-
No
-
None
-
None
-
None
-
OCPNODE Sprint 256 (Blue)
-
1
-
Customer Escalated
-
None
-
None
-
None
Description of problem:
The systemReserved settings via kubeletConfig don't get enforced to the system daemons. Customer experienced high load on a node on 30 May - "with the heavy load starting around 1900, we completely lost our node-exporter metrics at around 1925 and again at 20:40 for 5-10 minutes each." Despite having KubeletConfig with systemReserved.cpu: 4000m, the cpu shares in /sys/fs/cgroup/cpu/system.slice/cpu.shares shows 1024. Kubelet and CRI-O were unable to function.
Version-Release number of selected component (if applicable):
4.12.57
How reproducible:
100%
Steps to Reproduce:
1. Set system-reserved via kubeletconfig 2. When the system is under high load, kubelet and crio don't function
Actual results:
The system daemons including kubelet and crio don't function under high load and the system-reserved doesn't get enforced
Expected results:
The system daemons including kubelet and crio should be working fine when system-reseved is set
Additional info:
- is triggering
-
OCPNODE-3849 Corrective Measure for OCPBUGS-35508: system-reserved doesn't get enforced to system daemons
-
- Closed
-
- links to