-
Bug
-
Resolution: Done
-
Blocker
-
RHODS_1.1_GA
-
2
-
False
-
False
-
None
-
No
-
-
-
-
-
-
-
No
-
Undefined
-
No
-
Yes
-
None
-
-
MODH Sprint 25, MODH Sprint 26
Description of problem:
Both PyTorch and Tensorflow images will fail when spawning, apparently because the pod failed to be created and doesn't respond after the 10 minute timeout
Prerequisites (if any, like setup, operators/versions):
OCP 4.7.19 on PSI, RHODS 1.0.16
Steps to Reproduce
- Install RHODS
- Wait for CUDA builds
- Try to spawn PyTorch or Tensorflow images
Actual results:
Spawning fails because JupyterLab server is not responding
Expected results:
Spawning is successful, can load and use JupyterLab
Reproducibility (Always/Intermittent/Only Once):
Always
Build Details:
OCP 4.7.19 on PSI, RHODS 1.0.16
Additional info:
- blocks
-
RHODS-1497 As a QE, I want to have a minimal test case for the PyTorch image
-
- Closed
-
-
RHODS-1498 As a QE, I want to have a minimal test case for the Tensorflow image
-
- Closed
-