Uploaded image for project: 'Red Hat OpenShift Data Science'
  1. Red Hat OpenShift Data Science
  2. RHODS-1469

PyTorch & Tensorflow images cannot be spawned

XMLWordPrintable

    • MODH Sprint 25, MODH Sprint 26

      Description of problem:

      Both PyTorch and Tensorflow images will fail when spawning, apparently because the pod failed to be created and doesn't respond after the 10 minute timeout

      Prerequisites (if any, like setup, operators/versions):

      OCP 4.7.19 on PSI, RHODS 1.0.16

      Steps to Reproduce

      1. Install RHODS
      2. Wait for CUDA builds
      3. Try to spawn PyTorch or Tensorflow images

      Actual results:

      Spawning fails because JupyterLab server is not responding

      Expected results:

      Spawning is successful, can load and use JupyterLab

      Reproducibility (Always/Intermittent/Only Once):

      Always

      Build Details:

      OCP 4.7.19 on PSI, RHODS 1.0.16

      Additional info:

              tmckay@redhat.com Trevor Mckay (Inactive)
              rhn-support-lgiorgi Luca Giorgi
              Pablo Felix Pablo Felix (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              11 Start watching this issue

                Created:
                Updated:
                Resolved: