Loading...

XML

Word

Printable

Type: Bug
Resolution: Unresolved
Priority: Major
Fix Version/s: None
Affects Version/s: None
Component/s: PyTorch
Labels:
- pytorch_qa

Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Intelligence Requested:
Market:

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

Summary:
Multiple tests in TestSelectAlgorithmCPU are failing during PyTorch unit test execution on CPU platform with tensor comparison errors.

Test Class: inductor/test_cpu_select_algorithm.py::TestSelectAlgorithmCPU
Number of Failing Tests: 44
Platform: CPU
Test Type: Unit Test

Version Information:

PyTorch Commit: 6bdd8c9
Branch: main
Test Date: 2026-01-14
Sprint: Sprint 24

Failure Pattern:
Tests failing with related error patterns - tensor comparison failures in int4 quantization and weight-only quantization tests

Common Error:

AssertionError: Tensor-likes are not close!

Mismatched elements: 1249 / 16384 (7.6%)
Greatest absolute difference: 0.3125 at index (0, 1, 223) (up to 0.01 allowed)
Greatest relative difference: 30.125 at index (2, 3, 142) (up to 0.01 allowed)

Failing Tests (sample - 10 of 44):
1. test_int4_concat_woq_mm_batch_size_4_in_features_256_out_features0_group_size_32_cpu_bfloat16
2. test_int4_woq_mm_amx_batch_size_1_in_features_1024_out_features_128_group_size_32_cpu_bfloat16
3. test_int4_woq_mm_amx_batch_size_1_in_features_128_out_features_1024_group_size_64_cpu_bfloat16
4. test_int4_concat_woq_mm_batch_size_4_in_features_256_out_features0_group_size_128_cpu_bfloat16
5. test_int4_woq_mm_amx_batch_size_1_in_features_1024_out_features_1024_group_size_32_cpu_bfloat16
6. test_int4_woq_mm_amx_batch_size_1_in_features_128_out_features_1024_group_size_32_cpu_bfloat16
7. test_int4_woq_mm_amx_batch_size_4_in_features_1024_out_features_128_group_size_64_cpu_bfloat16
8. test_int4_woq_mm_amx_batch_size_4_in_features_128_out_features_128_group_size_64_cpu_bfloat16
9. test_int4_woq_mm_amx_batch_size_6_in_features_1024_out_features_1024_group_size_32_cpu_bfloat16
10. test_int4_woq_mm_amx_batch_size_6_in_features_128_out_features_1024_group_size_128_cpu_bfloat16
... and 34 more tests

Steps to Reproduce:
1. Run test command:

   TEST_CONFIG=cpu python3 test/run_test.py -i inductor/test_cpu_select_algorithm
   TEST_CONFIG=cuda python3 test/run_test.py -i inductor/test_cpu_select_algorithm
   TEST_CONFIG=inductor python3 test/run_test.py -i inductor/test_cpu_select_algorithm

2. Observe tensor comparison failures in int4 quantization tests

Expected Result:
All tests should pass

Actual Result:
44 tests fail with tensor comparison errors in int4 weight-only quantization operations

Root Cause Analysis:
Multiple int4 weight-only quantization (woq) tests are failing with numerical accuracy issues. The failures primarily affect:

int4_concat_woq_mm tests
int4_woq_mm_amx tests with various batch sizes and feature dimensions
Tests with different group sizes (32, 64, 128)
Both bfloat16 and float32 dtypes

Additional Context:

Note: sGPU ticket AIPCC-8251 exists for the same test class
This is the CPU-specific failure affecting 44 tests
All failures are related to int4 quantization accuracy
High priority due to the number of failing tests (44)

Logs:
Test execution logs: /home/ktanmay/Downloads/Run 1-20260120T060019Z-1-001/Run 1/20260114_024940_commit_6bdd8c9/cpu_tests.log

Priority: P2

Labels: pytorch, unittest, cpu, inductor, quantization, int4

Assignee:: Unassigned

Reporter:: PyTorch Engineering

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Created:: 2026/01/20 8:14 AM

Updated:: 2026/01/22 5:25 AM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates

PagerDuty