Uploaded image for project: 'OpenShift Request For Enhancement'
  1. OpenShift Request For Enhancement
  2. RFE-8679

Support generic job-related information for MPI, Spark, Ray and others

XMLWordPrintable

    • Icon: Feature Request Feature Request
    • Resolution: Unresolved
    • Icon: Undefined Undefined
    • None
    • None
    • None
    • None
    • Product / Portfolio Work
    • None
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

      Adapted from gap analysis doc: https://docs.google.com/document/d/1Oz0rvy9BtTkB8FrQtamMlIMOiDrM2Ov5pUTZDzNtKXQ/edit?tab=t.0

      Common HPC workloads—such as MPI, grid computing (both open and proprietary), Spark, and Ray—all need a standardized way to receive job-related information. Can a core OpenShift component, like Kueue, be leveraged or extended to provide this mechanism, offering a unified benefit to all these diverse frameworks?

       

      Potentially leverage "wrapper" solutions like https://github.com/kubernetes-sigs/kjob for translating Slurm jobs to Kueue Jobs

              gausingh@redhat.com Gaurav Singh
              jkincl@redhat.com Jason Kincl
              None
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated:
                None
                None