Uploaded image for project: 'Red Hat OpenShift Data Science'
  1. Red Hat OpenShift Data Science
  2. RHODS-540

Apply a toleration for GPU worker node taints for Jupyter notebooks that request GPUs

XMLWordPrintable

    • MODH Sprint 20, MODH Sprint 21, MODH Sprint 36, MODH Sprint 37, MODH Sprint 1.7

      For RHODS we plan to put a taint on GPU worker nodes so that we can restrict their use to only workloads that actually need GPUs.

      Before we can apply the taint, we need to modify the JupyterHub profiles code so that, if a user requests GPUs for their notebook pod, they get the necessary toleration to land on a GPU worker node.

      As such, we should add support for a config section in the Jupyterhub Single User Profiles ConfigMap to specify information related to GPUs. The config section would look similar to 

       

        gpu:
        - type: ...
          taint: ....

       

      From vpavlin@redhat.com:

      You will want to add the code here: https://github.com/opendatahub-io/jupyterhub-singleuser-profiles/blob/master/jupyterhub_singleuser_profiles/profiles.py#L206

      Probably refactor this https://github.com/opendatahub-io/jupyterhub-singleuser-profiles/blob/master/jupyterhub_singleuser_profiles/profiles.py#L294-L300 out to a separate method and just call that from the apply_gpu_config when gpu_count == 0

       

       

              llasmith@redhat.com Landon LaSmith
              acorvin@redhat.com Alex Corvin
              Pablo Felix Pablo Felix (Inactive)
              Votes:
              0 Vote for this issue
              Watchers:
              9 Start watching this issue

                Created:
                Updated:
                Resolved: