Uploaded image for project: 'Performance and Scale for AI Platforms'
  1. Performance and Scale for AI Platforms
  2. PSAP-1435

MLPerf Inference v4.1 Submissions

XMLWordPrintable

    • Icon: Epic Epic
    • Resolution: Unresolved
    • Icon: Critical Critical
    • None
    • None
    • None
    • None
    • MLPerf Inference v4.1 Submissions
    • Inference, RHOAI
    • Not Selected
    • False
    • False
    • None
    • 13
    • 0% To Do, 33% In Progress, 67% Done

      Epic Goal

      • Run and submit results for Llama-2-70b and Mixtral 8x22b MLPerf inference benchmarks for v4.1 submission

      Why is this important?

      • Validate that we can effectively run SoTA inference workloads on our platforms
      • Publish industry standard benchmarks demonstrating competitive performance of these workloads on Red Hat platforms

      Scenarios

      1. ...

      Acceptance Criteria

      • CI - MUST be running successfully with tests automated
      • Release Technical Enablement - Provide necessary release enablement details and documents.
      • ...

      Dependencies (internal and external)

      1. ...

      Previous Work (Optional):

      Open questions::

      Done Checklist

      • CI - CI is running, tests are automated and merged.
      • Release Enablement <link to Feature Enablement Presentation>
      • DEV - Upstream code and tests merged: <link to meaningful PR or GitHub Issue>
      • DEV - Upstream documentation merged: <link to meaningful PR or GitHub Issue>
      • DEV - Downstream build attached to advisory: <link to errata>
      • QE - Test plans in Polarion: <link or reference to Polarion>
      • QE - Automated tests merged: <link or reference to automated tests>
      • DOC - Downstream documentation merged: <link to meaningful PR>

              dagray@redhat.com David Gray
              dagray@redhat.com David Gray
              David Gray
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated: