Loading...

XML

Word

Printable

Type: Task
Resolution: Unresolved
Priority: Undefined
Fix Version/s: None
Affects Version/s: None
Component/s: PyTorch
Labels:
- pytorch_ci

Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Epic Link:
AIPCC-8378
Intelligence Requested:
Market:

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

Problem

CI builds are failing due to 215 known test failures that are platform-specific and documented. Need to implement test exclusions to achieve green builds while maintaining 99%+ test coverage.

Objective

Configure the RHEL test workflow to exclude known failing tests, achieving green CI builds without losing test coverage.

Current Status

Total tests: ~20,400
Failing tests: ~215 (1.05%)
Pass rate: ~99%
All failures are documented and understood

Tests to Exclude

1. inductor/test_cutlass_backend - 189 tests (CUTLASS library not installed)
2. inductor/test_flex_attention - 24 tests (H200 float16 alignment)
3. test_cpp_extensions_jit - 1 test (cuDNN JIT compilation)
4. test_partial_flat_weights - 1 test (RNN parameter handling)

Expected Results

All 5 test shards pass
~99% test coverage maintained
Core PyTorch functionality fully tested
CI builds show green status

Acceptance Criteria

[ ] Test exclusions added to _linux-test.yml
[ ] All exclusions documented with issue numbers
[ ] CI pipeline runs successfully with green status
[ ] Test coverage metrics confirm 99%+ coverage

References

Current workflow: .github/workflows/_linux-test.yml
Documentation: 5_SHARD_FAILURE_SUMMARY.md

Assignee:: Subin George

Reporter:: Subin George

Team:: PyTorch Infrastructure

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Created:: 2026/01/12 5:40 AM

Updated:: 2026/01/22 9:35 AM

Details

Description

Problem

Objective

Current Status

Tests to Exclude

Expected Results

Acceptance Criteria

References

Attachments

Easy Agile Planning Poker

Activity

People

Dates

PagerDuty