Loading...

XML

Word

Printable

Type: Task
Resolution: Done
Priority: Normal
Fix Version/s: None
Affects Version/s: None
Component/s: Development Platform
Labels:
None

Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Intelligence Requested:
Market:

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

Some vllm operations (and dependencies) might be relying on torch.utils.cpp_extension._get_cuda_arch_flags (https://github.com/pytorch/pytorch/blob/v2.7.1/torch/utils/cpp_extension.py?plain=1#L2303-L2322) to infer what architectures to build JIT components for.

At build time it makes sense to build using TORCH_CUDA_ARCH_LIST to build for all of our intended target architectures, but at run time, it's better to allow _get_cuda_arch_flags choosing the correct flags automatically based on the detected GPUs, which is the default behaviour when TORCH_CUDA_ARCH_LIST is unset.

is triggering

AIPCC-4282 base images: remove TORCH_CUDA_ARCH_LIST and PYTORCH_ROCM_ARCH

Closed

mentioned on

Merge request - AIPCC 4017 update chat templates

Merge request - AIPCC-4016: remove-torch-cuda-arch-list-env-var (backport to 3.2)

Merge request - AIPCC-4016: unset TORCH_CUDA_ARCH_LIST

Solved by commit 8082e9b0863d658528c03b28266372bf92bbeb38.

Assignee:: Daniele Trifirò

Reporter:: Daniele Trifirò

Team:: Antonio's Team

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2025/07/31 11:34 AM

Updated:: 2025/08/14 8:15 AM

Resolved:: 2025/08/01 10:31 AM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates

PagerDuty