-
Story
-
Resolution: Obsolete
-
Major
-
None
-
None
-
None
-
Product / Portfolio Work
-
Inference, RHOAI
-
False
-
False
-
-
8
-
PSAP - General-12, PSAP - General-13, PSAP - General-14
User Story:
Establish inference performance baselines for various load testing scenarios in llm load test. Models under consideration - the ones we currently test in model serving CPT + granite 7b + granite 3 8b + if there is anything else RHOAI QE is testing
Acceptance criteria:
Performance test report
Analysis
Any guidance for docs
Next steps