-
Feature
-
Resolution: Unresolved
-
Critical
-
None
-
None
-
Product / Portfolio Work
-
-
False
-
-
False
-
None
-
None
-
None
-
None
-
None
-
-
None
-
None
-
None
-
None
Overview
OpenShift's Dynamic Resource Allocation (DRA) for AI Workloads is designed to optimize the management of specialized resources like GPUs, critical for running AI/ML and large language model (LLM) inference workloads. By using flexible, dynamic requests for high-performance hardware, OpenShift empowers users to meet demanding AI application needs with cost-effective, real-time resource scaling and efficient workload scheduling.
- incorporates
-
OCPNODE-2510 DRA Upstream Work
-
- Closed
-
-
OCPSTRAT-1756 DRA: Attribute-Based GPU Allocation in OpenShift with NVIDIA GPU operator -TP 4.20
-
- Release Pending
-
-
OCPSTRAT-408 Deprecated : Structured parameter in DRA : refer to https://issues.redhat.com/browse/OCPSTRAT-1756
-
- Closed
-