Loading...

XML

Word

Printable

Epic Name:
Helm charts to run privately hosted AI models for developer use
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Epic Status:
Done
Feature Link:
RHDP-1097 - Helm charts and how-to guide to run OpenShift hosted AI models for developer use
Hierarchy Progress Bar:

0% To Do, 0% In Progress, 100% Done
Intelligence Requested:
Market:

Epic Goal

Work covers the Helm Chart that helps developers to run a privately hosted AI model for development
We'll be focusing on using Granite models
Two Helm chart deliveries:
- Something similar to what we're delivering for ~~DEVAI-192~~: llama.cpp, CPU-only, but this chart would not have an app in it nor app deployment
- Uses VLLM and GPUs so you can use a larger Granite model and get faster responses (definitely better for a shared model)

CI - MUST be running successfully with tests automated
Release Technical Enablement - Provide necessary release enablement details and documents.
...

Acceptance criteria are met
Non-functional properties of the Feature have been validated (such as performance, resource, UX, security or privacy aspects)
User Journey automation is delivered
Support and SRE teams are provided with enough skills to support the feature in production environment

depends on

RHIDP-10160 Automate the conversion from Software Templates to Helm Chart on AI samples