Loading...

XML

Word

Printable

Type: Story
Resolution: Done
Priority: Undefined
Fix Version/s: None
Affects Version/s: None
Component/s: Accelerator Enablement
Labels:
None

Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Epic Link:
vllm 0.10.1: add accelerator packages to the builder
Intelligence Requested:
Market:

Sprint:
AIPCC Accelerators 13, AIPCC Accelerators 14

Target Version:

RHAIIS-3.2.1

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

vLLM in CUDA does not use it, it uses vllm's fork of flash-attn:

https://github.com/vllm-project/vllm/blob/ae87ddd040b793fd9f4f05cb660a4728c81d7670/cmake/external_projects/vllm_flash_attn.cmake#L13-L26

We don't need to build it anymore for RHAIIS, so we need to update collections/rhaiis/cuda-ubi9/requirements.txt

mentioned on

Merge request - AIPCC-4124: remove flash-attn on CUDA

Solved by commit 7ac40b79d61f59c41d7640ab812b4d233d2aeaf2.

Assignee:: Emilien Macchi

Reporter:: Emilien Macchi

Team:: Frank's Team

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2025/08/05 5:09 PM

Updated:: 2025/08/27 4:54 PM

Resolved:: 2025/08/27 4:54 PM