Uploaded image for project: 'OpenShift Request For Enhancement'
  1. OpenShift Request For Enhancement
  2. RFE-8648

Enhanced GPU Allocation Visibility in OpenShift Console and Monitoring

XMLWordPrintable

    • None
    • Product / Portfolio Work
    • None
    • False
    • Hide

      None

      Show
      None
    • None
    • None
    • None
    • None
    • None
    • None
    • None
    • None

       [1]Proposed title of this feature request
      -> Enhanced GPU Allocation Visibility in OpenShift Console and Monitoring.

       [2]What is the nature and description of the request?
      -> 

      Customer is requesting a built-in and user-friendly way to view GPU allocation and usage across the OpenShift cluster. Specifically, they want visibility into:

      • GPU allocation per node
      • Total GPUs available vs allocated cluster-wide
      • Identification of which pods and namespaces are consuming GPUs

      Currently, this information can only be gathered through manual CLI commands or custom PromQL queries, which is not practical or scalable for clusters with multiple GPU-enabled nodes. The request is to provide:

      • A dedicated OpenShift Console dashboard (or console plugin)
      • Or out-of-the-box monitoring metrics and queries
      • Or an optional console add-on similar to existing observability plugins

       [3]Why does the customer need this? (List the business requirements here)
      -> 

      • Operational visibility: Administrators need a quick and reliable way to assess GPU capacity and utilization.
      • Efficient resource planning: Helps teams avoid GPU overcommitment or underutilization.
      • Faster troubleshooting: Enables quick identification of GPU-consuming workloads without manual investigation.
      • Scalability: Manual CLI-based checks do not scale for clusters with multiple GPU nodes and workloads.
      • Improved user experience: A visual dashboard significantly reduces operational overhead for platform teams managing AI/ML workloads.

       [4]List any affected packages or components.

      ->

      • OpenShift Console (UI / Console Plugins)
      • OpenShift Monitoring (Prometheus)
      • Cluster Observability 

       

              rh-ee-rfloren Roger Florén
              rhn-support-hthakare Harshal Thakare
              None
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated:
                None
                None