-
Epic
-
Resolution: Done
-
Blocker
-
OSSM 3.1.0
The Gateway API Inference Extension (GIE) provides routing features for AI Inference with VLLM model serving via Gateway API. Support for GIE in OSSM is a hard requirement and blocker for supporting llm-d on OpenShift, as we will be using the native Gateway API support introduced in 4.19 (which is based on OSSM) for llm-d.
Today, preliminary support for this exists in upstream Istio, but it's currently an alpha build that is released separately from the main releases. There is an upstream tracking issue for the work needed to release it, and there is an Istio RFC to get it fully baked in.
The purpose of this issue is to follow and deliver on the upstream tracking issue to deliver GIE support into mainline Istio, but ultimately into OSSM.
- blocks
-
NE-2050 Support for Gateway API Inference Extension
-
- Release Pending
-
-
OCPSTRAT-1757 Support for Gateway API Inference Extensions
-
- Release Pending
-