Epic Goal

Run and submit results for Llama-2-70b and Mixtral 8x22b MLPerf inference benchmarks for v4.1 submission

Why is this important?

Validate that we can effectively run SoTA inference workloads on our platforms
Publish industry standard benchmarks demonstrating competitive performance of these workloads on Red Hat platforms

CI - MUST be running successfully with tests automated
Release Technical Enablement - Provide necessary release enablement details and documents.
...

CI - CI is running, tests are automated and merged.
Release Enablement <link to Feature Enablement Presentation>
DEV - Upstream code and tests merged: <link to meaningful PR or GitHub Issue>
DEV - Upstream documentation merged: <link to meaningful PR or GitHub Issue>
DEV - Downstream build attached to advisory: <link to errata>
QE - Test plans in Polarion: <link or reference to Polarion>
QE - Automated tests merged: <link or reference to automated tests>
DOC - Downstream documentation merged: <link to meaningful PR>