Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-35508

system-reserved doesn't get enforced to system daemons

    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • Critical
    • No
    • None
    • None
    • None
    • OCPNODE Sprint 256 (Blue)
    • 1
    • Customer Escalated
    • None
    • None
    • None

      Description of problem:

      The systemReserved settings via kubeletConfig don't get enforced to the system daemons.
      
      Customer experienced high load on a node on 30 May - "with the heavy load starting around 1900, we completely lost our node-exporter metrics at around 1925 and again at 20:40 for 5-10 minutes each."
      
      Despite having KubeletConfig with systemReserved.cpu: 4000m, the cpu shares in /sys/fs/cgroup/cpu/system.slice/cpu.shares shows 1024.
      
      Kubelet and CRI-O were unable to function.

      Version-Release number of selected component (if applicable):

      4.12.57

      How reproducible:

      100%

      Steps to Reproduce:

      1. Set system-reserved via kubeletconfig
      2. When the system is under high load, kubelet and crio don't function
      

      Actual results:

      The system daemons including kubelet and crio don't function under high load and the system-reserved doesn't get enforced

      Expected results:

      The system daemons including kubelet and crio should be working fine when system-reseved is set

      Additional info:

       

              harpatil@redhat.com Harshal Patil
              rhn-support-alosingh Alok Singh
              None
              None
              Cameron Meadors Cameron Meadors
              None
              Votes:
              0 Vote for this issue
              Watchers:
              11 Start watching this issue

                Created:
                Updated:
                Resolved: