Uploaded image for project: 'Performance and Scale for AI Platforms'
  1. Performance and Scale for AI Platforms
  2. PSAP-1118

Adapt / rewrite llm-load-test to be general purpose for more models

XMLWordPrintable

    • Icon: Story Story
    • Resolution: Done
    • Icon: Normal Normal
    • None
    • None
    • None

      User Story:
      As a performance engineer

      I want a general purpose load testing tool for performance testing large language models and the underlying platform they are deployed on (modelmesh / kserve / watsonx stack). This tool should be able to query models with a gRPC or REST API

      So that I can use it to test many different models with only minor changes to a config file.

       

      Notes:

      • We created github.com/openshift-psap/llm-load-test for testing the Ansible Lightspeed model but it is currently just a short set of scripts that is very hardcoded for this model. We should see if we can leverage an existing tool like iter8, or adapt llm-load-test to build on top of it.

      Acceptance criteria:

            dagray@redhat.com David Gray
            dagray@redhat.com David Gray
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

              Created:
              Updated:
              Resolved: