Loading...

XML

Word

Printable

Type: Story
Resolution: Unresolved
Priority: Undefined
Component/s: None
Labels:
- arm-performance

Story Points:
13
Epic Link:
RHELPERF-102
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Intelligence Requested:
Market:

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Utilize GuideLLM (from Neural Magic) as an AI Workload suitable for multi-arch CPU-mode testing.

Use two inferencing engines: llama.cpp and vLLM which supports CPU-Mode as documented here

https://docs.vllm.ai/en/stable/getting_started/installation/cpu.html#set-up-using-docker

Document investigation findings along with setup procedures and initial test results, if possible.

clones

RHELPERF-104 Investigate CPU-Mode AI Workloads for multi-arch performance testing

In Progress

Assignee:: John Harrigan

Reporter:: John Harrigan

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Created:: 2025/05/22 8:54 PM

Updated:: 2025/07/03 6:12 PM