Uploaded image for project: 'Performance and Scale for AI Platforms'
  1. Performance and Scale for AI Platforms
  2. PSAP-1415

Performance testing of caikit embeddings service

XMLWordPrintable

    • Icon: Epic Epic
    • Resolution: Unresolved
    • Icon: Major Major
    • None
    • None
    • None
    • None
    • Performance testing of caikit embeddings service
    • False
    • None
    • False
    • Not Selected
    • To Do
    • 0% To Do, 40% In Progress, 60% Done

      Epic Goal

      The Caikit embeddings service will be added to RHOAI in an upcoming release.  See feature refinement doc: https://docs.google.com/document/d/1UQf7aGvXEBKYoq5Fah0GNJjqEpHTqvWvwQ__LhdjS8c/edit

      This will require performance testing from us. This will require some new test / automation. It may be possible to extend llm-load-test to test embedding models, or we may want a separate tool.

      Why is this important?

      Scenarios

      1. ...

      Acceptance Criteria

      • CI - MUST be running successfully with tests automated
      • Release Technical Enablement - Provide necessary release enablement details and documents.
      • ...

      Dependencies (internal and external)

      1. ...

      Previous Work (Optional):

      Open questions::

      Done Checklist

      • CI - CI is running, tests are automated and merged.
      • Release Enablement <link to Feature Enablement Presentation>
      • DEV - Upstream code and tests merged: <link to meaningful PR or GitHub Issue>
      • DEV - Upstream documentation merged: <link to meaningful PR or GitHub Issue>
      • DEV - Downstream build attached to advisory: <link to errata>
      • QE - Test plans in Polarion: <link or reference to Polarion>
      • QE - Automated tests merged: <link or reference to automated tests>
      • DOC - Downstream documentation merged: <link to meaningful PR>

            rh-ee-dripberg Drew Ripberger
            dagray@redhat.com David Gray
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated: