-
Story
-
Resolution: Obsolete
-
Normal
-
None
-
None
-
None
User Story:
As a a performance engineer, I want to measure the performance benefit of new vLLM features (chunked prefills and splitwise prefill & decode disaggregation) in order to have data which can be shared with engineering teams and customers seeking guidance.
If these are beneficial we can use them in some of our CPT testing configurations to track the impact.
Acceptance criteria: