Loading...

XML

Word

Printable

Type: Story
Resolution: Done
Priority: Normal
Fix Version/s: None
Affects Version/s: None
Component/s: Accelerator Enablement
Labels:
- wheels

Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Epic Link:
Upstream viable patches to reduce long-term maintenance
Intelligence Requested:
Market:

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

The patch vllm-0.7.2/cuda-ubi9/0003-Remove-version-munging.patch disables code in vLLM setup.py that adds local version numbers. A CUDA 12.4 build gets +cu128, a CUDA 12.1 builds gets no suffix (see MAIN_CUDA_VERSION) , a CPU build gets +cpu, and so on.

For downstream builds, we don't want any local version numbers. Work with upstream and figure out if they are willing to accept a patch that lets us customize and disable local version number with an env var

Idea:

VLLM_LOCAL_VERSION=none disables local version numbers completely

mentioned on

Merge request - AIPCC-443, AIPCC-3173, AIPCC-3557, AIPCC-3563: Replace resolve_source with get_resolver_provider in the vllm plugin

Solved by commit 063b4219183a1dbf7d206befbdb3495c2242d51b.

Assignee:: Percy Mattsson

Reporter:: Christian Heimes

Team:: Frank's Team

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2025/02/24 12:51 PM

Updated:: 2025/10/15 9:45 AM

Resolved:: 2025/08/07 2:44 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates

PagerDuty