Uploaded image for project: 'Open Data Hub'
  1. Open Data Hub
  2. ODH-431

cuda build disrupts the whole cluster when cluster is on all master nodes

XMLWordPrintable

    • Icon: Bug Bug
    • Resolution: Unresolved
    • Icon: Normal Normal
    • None
    • None
    • JupyterHub

      One of the users dpeterso@redhat.com has a 3 node condensed cluster. 
      while using the Cuda build in the cluster through the opendatahub, it disrupts the whole cluster, hinders heavily with the API server of openshift.
      Have attached more informed in attachments.

      The is Nvidia GPU operator installed just for information.
      we would need someone with the expertise of cluster management and OpenShfit to tackle this issue.

      more information can be found at: https://chat.google.com/room/AAAAiODw-Fc/u_gtkHLTqPg

        1. image (2).png
          image (2).png
          49 kB
        2. image (3).png
          image (3).png
          62 kB
        3. image (4).png
          image (4).png
          103 kB
        4. image (5).png
          image (5).png
          43 kB

              vpavlin@redhat.com Vaclav Pavlin (Inactive)
              hnalla Harshad Reddy Nalla
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

                Created:
                Updated: