Uploaded image for project: 'AI Platform Core Components'
  1. AI Platform Core Components
  2. AIPCC-8517

[PyTorch][Upstream CI] Investigate cuDNN JIT Extension Compilation Failure

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Undefined Undefined
    • None
    • None
    • PyTorch
    • False
    • Hide

      None

      Show
      None
    • False

      Problem

      Test test_cpp_extensions_jit::test_jit_cudnn_extension fails on RHEL 9.6 due to cuDNN C++ extension JIT compilation issues.

      Root Cause (Suspected)

      • cuDNN header/library version mismatch
      • JIT compilation environment issues specific to RHEL 9.6
      • CUDA toolkit configuration problems
      • C++ compiler flags incompatibility with GCC 11

      Impact

      • Tests failing: 1
      • Severity: Very Low - Advanced feature rarely used in production
      • Production impact: None - JIT extension compilation is optional

      Current Workaround

      Test excluded in workflow configuration

      Acceptance Criteria

      • [ ] Root cause identified with detailed error analysis
      • [ ] cuDNN extension compiles successfully via JIT
      • [ ] test_jit_cudnn_extension passes on RHEL

      References

              rh-ee-sugeorge Subin George
              rh-ee-sugeorge Subin George
              PyTorch Infrastructure
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated: