Epic Goal

Performance testing for RHOAI model serving is an ongoing effort including running the CPT, analyzing the results, expanding our test coverage, and iterating on the tools involved. This epic is meant to capture the related work we intend to complete during Q2 2024.

Continuous performance testing of RHOAI model serving stack for release-to-release regression analysis
Improvements to the automated performance testing pipeline for RHOAI model serving stack and the included tools (llm-load-test, topsail)
Enhancements to the test coverage: new models, new runtimes, new hardware configurations
Performance experiments with LLM model serving on RHOAI, with the goal of gathering data which can be used to guide customers on sizing and hardware/platform recommendations for different models

Why is this important?

LLM model serving is currently a top priority for OpenShift AI and the company.
These workloads are performance-sensitive and require expensive hardware to run effectively. Many customers are interested in leveraging LLMs for their business use-cases, but performance and cost efficiency are critical in doing so.
We need to catch any potential regressions in the LLM model serving stack in RHOAI as early as possible

Scenarios

Acceptance Criteria

We have completed our planned model serving performance testing for each RHOAI releases (Starting with 2.10)
All enhancement stories have been completed or moved to a follow-up epic

Dependencies (internal and external)

Previous Work (Optional):

Open questions:

The current single-model tests take >5 hours to run. How can we add more models and runtime combinations without increasing this length to 12+ hours? Different test cases that we run on different frequencies? Relegate some test cases to only one-off experiments?

relates to

PSAP-1242 [kserve-prom] Generate KPI metrics from the LTS payload 1/2

Closed

PSAP-1332 [kserve-prom] Generate KPI metrics from the LTS payload 2/2

Closed

mentioned in: Page Loading...

Assignee:: David Gray

Reporter:: David Gray

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Created:: 2024/03/22 7:34 PM

Updated:: 2024/10/07 2:20 PM

Resolved:: 2024/10/07 2:20 PM

Details

Description

Epic Goal

Why is this important?

Scenarios

Acceptance Criteria

Dependencies (internal and external)

Previous Work (Optional):

Open questions:

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates

Hide