Uploaded image for project: 'AI Platform Core Components'
  1. AI Platform Core Components
  2. AIPCC-6275

[QA][PyTorch UT][CPU] test_monitor and test_tensorboard modules are failing due to assertion error

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Undefined Undefined
    • None
    • None
    • PyTorch
    • False
    • Hide

      None

      Show
      None
    • False

      Both modules are failing due to assertion error

      Failing Tests:

      test_monitor-  test/test_monitor.py::TestMonitorTensorboard::test_event_handler

      test_tensorboard-  test/test_tensorboard.py::TestTensorBoardWriter::test_writer

      Version numbers (base image, wheels, builder, etc):

      • PyTorch version: 2.10.0a0+git3cc8af2
      • Branch: main
      • Commit ID: 3cc8af2d67f42bf2a933796290446c5ab8978aac)
      • OS: RHEL 9.6
      • Python version: 3.12.11

      Steps to Reproduce:

      1. Log in to H200.
      1. Login to quay.io:
        podman login quay.io
      2. Pull base image:
        podman pull quay.io/rh-ee-sugeorge/base_image_pytorch_rhel
      3. Run the image:
        podman run -it <IMAGE_NAME>
      4. Run the PyTorch CPU UT:
        TEST_CONFIG=cpu python3 test/run_test.py --verbose --continue-through-error -i test_monitor

              rh-ee-rpunia Riya Punia
              rh-ee-rpunia Riya Punia
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated: