-
Epic
-
Resolution: Unresolved
-
Critical
-
None
-
None
-
RHEL AI Third-Party Model Validation Support for Summit '25
-
False
-
-
Not Selected
-
To Do
Goal:
Validate third-party models in InstructLab flows and Inference flows on vLLM for Summit by doing inference performance benchmarking and accuracy evaluations for third-party models to give customers flexibility, confidence and predictability bringing third-party models to Instruct Lab and vLLM within RHEL AI.
- Implement Llama 3.3 70B Teacher Support in InstructLab for RHEL AI 1.5
- inference covered here with i lab serve test
- Enable Granite 3.1 8B base starter v2 (base) Student Support in InstructLab for RHEL AI 1.5
- inference covered here with i lab serve test
- Inference testing w/ ilab download, ilab model list, and ilab serve for:
- the models listed here: https://docs.google.com/spreadsheets/d/1NGPhJV0pk7jYuAFOHk7aWPomX7Svb_-Xa-OVUVtpNbM/edit?gid=1505755754#gid=1505755754 - validated model list
- Models packaged and validated for use on Quay for InstructLab flows
Acceptance Criteria:
- Instruct Lab flows enable users to swap the teacher out (pulled in from Quay) to Llama 3.3 70B, student should remain as Granite 3.1 8B
- InstructLab flows enable users to swap the student out (pulled in from Quay) to Llama 3.1 8B, teacher should remain as Mixtral 7x8B
- Give users confidence deploying inference w/:
- End-to-end functional tests are completed in RHEL AI 1.5
- Note: the inference performance benchmarks and general evaluation (OpenLLMLeaderboard v1/v2 ONLY) will be done by the PSAP team.
- We don't need to show dk-bench results/significant accuracy improvements for Summit Timeline.
- The story is around flexibility and optionality to BYOM (not that you get better performance using these third-party models...yet)
- depends on
-
RHELAI-3559 [ilab] Running a third-party Llama 3.3 70B Instruct model as teacher model (+ inference functional testing) in ilab tuning flow
-
- In Progress
-
-
RHELAI-3560 [ilab] Exposing third-party models for both inference and ilab flows in RHEL AI and RHOAI (Push to Quay)
-
- In Progress
-
-
RHELAI-3481 [Llama Stack] InstructLab with 3rd Party Models (Inference Only)
-
- Closed
-
- is blocked by
-
RHELAI-3560 [ilab] Exposing third-party models for both inference and ilab flows in RHEL AI and RHOAI (Push to Quay)
-
- In Progress
-