-
Story
-
Resolution: Unresolved
-
Undefined
-
None
-
13
-
False
-
-
False
-
-
Identify an AI Workload suitable for multi-arch CPU-mode testing.
The first candidate to be pursued, vLLM supports CPU-Mode as documented here
https://docs.vllm.ai/en/stable/getting_started/installation/cpu.html#set-up-using-docker
Document investigation findings along with setup procedures and initial test results, if possible.
- is cloned by
-
RHELPERF-118 Investigate GuideLLM workload harness for CPU-Mode inferencing performance comparison
-
- In Progress
-