Loading...

XML

Word

Printable

Type: Story
Resolution: Done
Priority: Normal
Fix Version/s: None
Affects Version/s: None
Component/s: None
Labels:
- AI/ML
- RHOAI
- RHODS

Epic Link:
Performance and Scale testing for RHOAI releases with KServe stack
Ready:
False
Blocked:
False
Blocked Reason:
None

Intelligence Requested:
Market:

User Story:
As a performance engineer

I want a general purpose load testing tool for performance testing large language models and the underlying platform they are deployed on (modelmesh / kserve / watsonx stack). This tool should be able to query models with a gRPC or REST API

So that I can use it to test many different models with only minor changes to a config file.

Notes:

We created github.com/openshift-psap/llm-load-test for testing the Ansible Lightspeed model but it is currently just a short set of scripts that is very hardcoded for this model. We should see if we can leverage an existing tool like iter8, or adapt llm-load-test to build on top of it.

Acceptance criteria:

Assignee:: David Gray

Reporter:: David Gray

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Created:: 2023/07/07 7:31 PM

Updated:: 2023/12/04 8:14 PM

Resolved:: 2023/12/04 8:14 PM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates