Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-57388

Guaranteed QOS VMI pod slowness due to CPU throttling

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Critical Critical
    • None
    • 4.20.0
    • Node / CRI-O
    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • None
    • None
    • Approved
    • OCP Node Sprint 273 (blue), OCP Node Sprint 274 (blue), OCP Node Sprint 275 (blue)
    • 3
    • In Progress
    • Bug Fix
    • Fixed wrong cpu.max settings when running GuaranteedQoS Pods with cpu manager.
    • None
    • None
    • None
    • None

      Description of problem:

      When starting a vmi with guaranteed QOS, the resulting virt-launcher pod is very slow. After some analysis, it shows that the cgroups are configured as the following: 
      ```
      /<path-to-pod>/<our-container>/cpu.max (1000 100000)
      /<path-to-pod>/<our-container>/container/cpu.max (max 100000)
      ```
      Which means that the cpu is throttled.
      
      The same pod with 4.19 the cgroups:
      ```
      /<path-to-pod>/<our-container>/cpu.max (200000 100000)
      /<path-to-pod>/<our-container>/container/cpu.max (200000 100000)
      ```

      Version-Release number of selected component (if applicable):

      4.20    

      How reproducible:

      Everytime

      Steps to Reproduce:

          1. Start a vm with dedicated cpu
          2. Look at the cgroups in the node where the pod has been scheduled
          3. Try to console the vmi
          

      Actual results:

      The vmi is booting very very slow

      Expected results:

      The vmi booting at normal speed

      Additional info:

      U/S cri-o issue: https://github.com/cri-o/cri-o/issues/9251
      
      
      We also noticed a general slowness in the runtime of the CI with provider 1.33. In particular, with 1.32 provider the testsuite takes ~1h30m; against the 4h20m with provider 1.33.
      My personal suspect is that, even with non guuaranteed QOS, there is something wrong. 

              rh-ee-atokubi Ayato Tokubi
              ffossemo@redhat.com Federico Fossemo
              None
              None
              Min Li Min Li
              None
              Votes:
              0 Vote for this issue
              Watchers:
              8 Start watching this issue

                Created:
                Updated: