-
Bug
-
Resolution: Unresolved
-
Critical
-
RHAIIS-3.2
-
None
-
False
-
-
False
-
-
-
Critical
On blackwell with xformers==0.0.30 we get:
INFO 07-14 21:02:11 [gpu_model_runner.py:2238] Encoder cache will be initialized with a budget of 8192 tokens, and profiled with 3 image items of the maximum feature size.
CUDA error (/mnt/work-dir/xformers-0.0.30/xformers-0.0.30/third_party/flash-attention/hopper/flash_fwd_launch_template.h:175): no kernel image is available for execution on the device