Uploaded image for project: 'Red Hat Enterprise Linux AI'
  1. Red Hat Enterprise Linux AI
  2. RHELAI-3778

Work to support distributed training on llama-stack for summit

XMLWordPrintable

    • Summit 2025 Training Demo on LLS
    • False
    • Hide

      None

      Show
      None
    • False
    • Not Selected
    • In Progress
    • RHELAI-3905 - Booth Demo Story (Your Data/Your Model/Your Move)
    • 0% To Do, 20% In Progress, 80% Done

      Goal:

      Provide high-level goal statement; providing user context and expected user outcome(s) for this Epic. 2-3 sentences... 

      • Running Distributed Training via LLS currently is not possible in a manner compatible with RHOAI or RHEL AI. A goal for summit is to enabled SDG and Training via LLS. A key part of this work will be enabling distributed training via a provider likely using Kubeflow in some fashion.
         

      Acceptance Criteria:

      The Acceptance Criteria provides a definition of scope and the expected outcomes - from a users point of view - defines the value proposition

      • Distributed Training can be run via LLS and is compatible with SDG output using Kubeflow.

       

      Repo for Summit related work: https://github.com/opendatahub-io/llama-stack-provider-kft

              shan@redhat.com Sebastien Han
              cdoern@redhat.com Charles Doern
              Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

                Created:
                Updated:
                Resolved: