Uploaded image for project: 'OpenShift Bugs'
  1. OpenShift Bugs
  2. OCPBUGS-74175

[Kueue] - LeaderWorkerSet and Workers are restarting in a loop

    • Icon: Bug Bug
    • Resolution: Not a Bug
    • Icon: Undefined Undefined
    • None
    • 4.20
    • LeaderWorkerSet
    • None
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      After installing LWS and Kueue, applying all the resources needed for Kueue, when LWS and Workers are created, they keep restarting in a loop.

       

      Steps to reproduce:

      1. Install Kueue (Operator and Operand) and LWS (Operator and Operand)
        • Add LeaderWorkerSet to Kueue Operand CR
      2. Create a Resource Flavor, Cluster Queue, Namespace and Local Queue
      3. Apply the LeaderWorkerSet template available on LWS docs (also attached in this bug): https://kueue.sigs.k8s.io/docs/tasks/run/leaderworkerset/#example 
      4. Check if LWS and Workers created

      Actual: LWS and Workers keep restarting in a loop.

      Expected: LWS and Workers should be created successfully.

      Ps.: I've attached a video "lws.mov" showing the behavior.

        1. lws.mov
          43.83 MB
          Alice Nahas
        2. sample-leaderworkerset.yaml
          0.7 kB
          Alice Nahas

              Unassigned Unassigned
              rh-ee-anahas Alice Nahas
              None
              None
              Cameron Meadors Cameron Meadors
              None
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated:
                Resolved: