Loading...

XML

Word

Printable

Type: Story
Resolution: Done
Priority: Undefined
Fix Version/s: None
Affects Version/s: None
Component/s: Development Platform
Labels:
None

Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Intelligence Requested:
Market:

Sprint:
DP Unfinished Issues, AP Sprint 12

Target Version:

RHAIIS-3.2.1

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

I added TORCH_CUDA_ARCH_LIST and PYTORCH_ROCM_ARCH args and env variables in the hope that the information would be useful. I wanted to have one central place to configure and record supported GPU architectures.

The idea turned out to cause more problems than benefits:

Wheel builder is changing CUDA arch list more often than base images are released.
The presence of TORCH_CUDA_ARCH_LIST can slow down vLLM startup, see ~~AIPCC-4016~~. Some operations and dependencies might compile kernels just-in-time. Without TORCH_CUDA_ARCH_LIST, only cubins for the current GPU arch are compiled. With TORCH_CUDA_ARCH_LIST present, code is compiled for additional archs.

Let's remove these env ars.

is triggered by

AIPCC-4016 rhaiis: Remove TORCH_CUDA_ARCH_LIST env var

Closed

mentioned on

Merge request - AIPCC-4282: Remove TORCH_CUDA_ARCH_LIST and PYTORCH_ROCM_ARCH

Assignee:: Christian Heimes

Reporter:: Christian Heimes

Team:: Antonio's Team

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2025/08/12 6:15 AM

Updated:: 2025/08/12 1:25 PM

Resolved:: 2025/08/12 1:25 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates

PagerDuty