Uploaded image for project: 'Red Hat Internal Developer Platform'
  1. Red Hat Internal Developer Platform
  2. RHIDP-10239

Helm charts to run privately hosted AI models for developer use

    • Icon: Epic Epic
    • Resolution: Done
    • Icon: Undefined Undefined
    • None
    • None
    • None
    • Helm charts to run privately hosted AI models for developer use
    • False
    • Hide

      None

      Show
      None
    • False
    • Done
    • RHDP-1097 - Helm charts and how-to guide to run OpenShift hosted AI models for developer use
    • 0% To Do, 0% In Progress, 100% Done

      Epic Goal

      • Work covers the Helm Chart that helps developers to run a privately hosted AI model for development
      • We'll be focusing on using Granite models
      • Two Helm chart deliveries:
        • Something similar to what we're delivering for DEVAI-192: llama.cpp, CPU-only, but this chart would not have an app in it nor app deployment
        • Uses VLLM and GPUs so you can use a larger Granite model and get faster responses (definitely better for a shared model)

      Why is this important?

      Scenarios

      1. ...

      Acceptance Criteria (Mandatory)

      • CI - MUST be running successfully with tests automated
      • Release Technical Enablement - Provide necessary release enablement details and documents.
      • ...

      Dependencies (internal and external)

      1. ...

      Previous Work (Optional):

      Open questions::

      •  

      Done Checklist

      • Acceptance criteria are met
      • Non-functional properties of the Feature have been validated (such as performance, resource, UX, security or privacy aspects)
      • User Journey automation is delivered
      • Support and SRE teams are provided with enough skills to support the feature in production environment

              Unassigned Unassigned
              eyuen@redhat.com Elson Yuen
              RHIDP - AI
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated:
                Resolved: