Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-57434

High latency in jobs and pods

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Done
    • Icon: Critical Critical
    • 4.19.0
    • 4.18
    • Node / Kueue
    • Quality / Stability / Reliability
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • None
    • None
    • None
    • OCP Node Sprint 273 (Green)
    • 1
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      I've observed a significant increase in the latency when creating jobs/pods at scale assigned to a Kueue's LocalQueue compared to an scenario not using Kueue.

      In a compact cluster scenario using Kueue-operator 0.1.0, where we create either 500 replicas of pause pods or 500 replicas of a simple sleeping job.

      In the case of pod ready latency, it goes from 2 seconds withouth Kueue to 40 seconds aprox when using Kueue

      And in the case of jobs (with parallelism 1), StartTime latency increases from 0 seconds to 45 seconds aprox.

        1. screenshot-2.png
          screenshot-2.png
          13 kB
        2. screenshot-1.png
          screenshot-1.png
          13 kB
        3. image-2025-07-11-14-21-23-273.png
          image-2025-07-11-14-21-23-273.png
          18 kB
        4. image-2025-07-11-14-19-57-788.png
          image-2025-07-11-14-19-57-788.png
          18 kB

              skunkerk Sohan Kunkerkar
              rsevilla@redhat.com Raul Sevilla Canavate
              None
              None
              Aditi Sahay Aditi Sahay
              None
              Votes:
              0 Vote for this issue
              Watchers:
              6 Start watching this issue

                Created:
                Updated:
                Resolved: