Loading...

XML

Word

Printable

Type: Feature Request
Resolution: Won't Do
Priority: Normal
Fix Version/s: None
Affects Version/s: None
Component/s: ai-ml-workloads, Node
Labels:
None

Target Version:
None
Activity Type:
Product / Portfolio Work
Status Summary:
None
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Products:
None
Hierarchy Progress Bar:
None

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

PX Review Complete:
None
PX Impact Score:
PX Impact Range:
None
PX Priority Data:
None
PX Technical Impact:
None
PX Technical Impact Notes:
None
PX Scheduling Request:
None

1. Proposed title of this feature request

KAI scheduler

2. What is the nature and description of the request?

Enable KAI scheduler

git repo:https://github.com/NVIDIA/KAI-Scheduler

Key Features We Need:

Fractional GPU support without explicit GPU operator configuration
Advanced scheduling capabilities (fair scheduling, bin packing)
Better GPU resource management compared to Kubernetes default scheduler

Goal: boost GPU utilization, reduce fragmentation/idle time, and ensure fair/resource-efficient scheduling for LLM training/inference.

3. Why does the customer need this? (List the business requirements here)

Current Gap:

We understand OpenShift AI now ships with Kueue support, but Kueue relies on the -Kubernetes default scheduler and lacks the GPU-specific features that KAI provides.

4. List any affected packages or components.

Kueue

relates to

OCPSTRAT-1786 Gang Scheduling for OpenShift

In Progress

Assignee:: Duncan Hardie

Reporter:: Abhijit Roy

Need Info From:: None

Votes:: 0 Vote for this issue

Watchers:: 4 Start watching this issue

Created:: 2025/09/29 1:13 PM

Updated:: 2025/10/29 3:05 PM

Resolved:: 2025/10/29 3:05 PM

Target start:: None

Target end:: None