Uploaded image for project: 'AI Platform Core Components'
  1. AI Platform Core Components
  2. AIPCC-4392

Build rhaiis-model-opt image for CUDA

    • False
    • Hide

      None

      Show
      None
    • True
    • 0% To Do, 0% In Progress, 100% Done
    • M
    • Hide

      October 2, 2025 (GREEN): done, released together with the rest of the artifacts in RHAIIS 3.2.2. We are waiting for INFERENG-1836 to be closed before closing the entire Feature.

      Show
      October 2, 2025 ( GREEN ): done, released together with the rest of the artifacts in RHAIIS 3.2.2. We are waiting for INFERENG-1836 to be closed before closing the entire Feature.

      Feature Overview:

      The goal is to provide a new image as part of RHAIIS with model optimization components. We will start with llm-compressor.

      Product(s) associated:

      RHAIIS
      **

      Goals:

      • Build the packages and image for model optimizations tools as part of RHAIIS for CUDA
      • quay.io (stage) image name: rhaiis-model-opt/cuda-ubi9
      • Pyxis (production) image name: rhaiis/model-opt-cuda-rhel9

      Requirements:

      • A new image, rhaiis-model-opt, is created.
      • The new image follows the same versioning as the main rhaiis image containing vllm.
      • The new image includes llm-compressor and its dependencies.
      • The packages to include are documented in https://issues.redhat.com/browse/INFERSTRAT-72

      Done - Acceptance Criteria:
      Acceptance Criteria articulates and defines the value proposition - what is required to meet the goal and intent of this Feature. The Acceptance Criteria provides a detailed definition of scope and the expected outcomes - from a users point of view

      Use Cases - i.e. User Experience & Workflow:
      Include use case diagrams, main success scenarios, alternative flow scenarios.

      Out of Scope:

      Details of this feature were copied from: https://issues.redhat.com/browse/INFERSTRAT-68 . GuideLLM is not in scope for 3.2.2, but will be looked into in future releases.

      Documentation Considerations :

      The RHAIIS team is responsible for product documentation.

              klara@redhat.com Klara Bezdekova
              selbi@redhat.com Selbi Nuryyeva
              Doug Hellmann
              Votes:
              0 Vote for this issue
              Watchers:
              10 Start watching this issue

                Created:
                Updated:
                Resolved: