Loading...

XML

Word

Printable

Type: Story
Resolution: Obsolete
Priority: Normal
Fix Version/s: Jan 13
Affects Version/s: None
Component/s: None
Labels:
None

Epic Link:
RHOAI Model Serving CPT Q4 2024
Workstream:

Inference, RHOAI
Ready:
False
Blocked:
False
Blocked Reason:

Hide

None

Show
None

SFDC Cases Counter:
SFDC Cases Links:
SFDC Cases Open:

Intelligence Requested:
Market:

User Story:
As a a performance engineer, I want to measure the performance benefit of new vLLM features (chunked prefills and splitwise prefill & decode disaggregation) in order to have data which can be shared with engineering teams and customers seeking guidance.

If these are beneficial we can use them in some of our CPT testing configurations to track the impact.

Acceptance criteria:

Assignee:: David Whyte-Gray

Reporter:: David Whyte-Gray

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Created:: 2024/06/26 12:51 AM

Updated:: 2026/01/13 3:17 PM

Resolved:: 2026/01/05 8:34 PM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates