Epic Goal

Performance testing for RHOAI model serving is an ongoing effort including running the CPT, analyzing the results, expanding our test coverage, and iterating on the tools involved. This epic is meant to capture the model serving performance work that doesn't require a full epic of it's own for tracking.

Why is this important?

The underlying components are quickly evolving and there is a constant stream of new capabilities and configurations for us to test.
LLM model serving is currently a top priority for OpenShift AI and the company.
These workloads are performance-sensitive and require expensive hardware to run effectively. Many customers are interested in leveraging LLMs for their business use-cases, but performance and cost efficiency are critical in doing so.
We need to catch any potential regressions in the LLM model serving stack in RHOAI as early as possible

Assignee:: David Gray

Reporter:: David Gray

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Created:: 2024/06/26 12:44 AM

Updated:: 2024/11/11 10:33 PM

Details

Description

Epic Goal

Why is this important?

Attachments

Easy Agile Planning Poker

Activity

People

Dates