Uploaded image for project: 'OpenShift Node'
  1. OpenShift Node
  2. OCPNODE-3739

DRA: e2e test suite that validates Nvidia GPU

XMLWordPrintable

    • Icon: Epic Epic
    • Resolution: Unresolved
    • Icon: Undefined Undefined
    • None
    • None
    • None
    • None
    • DRA: e2e test suite that validates Nvidia GPU
    • In Progress
    • Product / Portfolio Work
    • OCPSTRAT-2384GA-Attribute-Based GPU Allocation in OpenShift with NVIDIA K8s DRA Driver
    • 36% To Do, 14% In Progress, 50% Done
    • False
    • Hide

      None

      Show
      None
    • False
    • Not Selected
    • None
    • None
    • None

      This is the follow up work required to get the DRA e2e suite running in OpenShift CI as a periodic job

      Goal:

      • a periodic job in CI that provisions a cluster with gpu worker node and runs the e2e suite

       

      Non Goal:

      • Our focus is limited to validating workload with Nvidia GPU (using the DRA driver), we will not add support for any other vendor.

       

      The e2e suite is being worked on here: https://github.com/openshift/origin/pull/29842 . It covers the following use cases now:

      • define a common test spec (one pod, one container asking for a distinct GPU) that can be validated against both the example DRA driver and the Nvidia DRA driver. The goal is to have a spec that is expected to pass on both
      • two containers, each asking for a distinct GPU; one container should not have access to the other's GPU
      • MPS strategy
      • TimeSlicing strategy
      • static pre-partitioned MIG slices
      • IPC using CUDA

       

      Constraints:

       

       

              abukashem Abu Kashem
              abukashem Abu Kashem
              None
              None
              Aditi Sahay Aditi Sahay
              None
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated: