-
Bug
-
Resolution: Unresolved
-
Critical
-
None
-
OSSM 3.1.0
-
None
-
False
-
-
False
-
-
-
Customer Facing
We are seeing an issue where smart load-balancing doesn't work for OpenShift AI and llm-d when multiple models (1 HTTPRoute each) share a Gateway on OCP 4.20 (and maybe affects OCP 4.21 too).
This is caused by a bug in InferencePool support in Istio.
This is already fixed upstream in Istio 1.29: commit.
Can we expedite a backport of this commit to previous OSSM and OCP versions?