Loading...

XML

Word

Printable

Type: Bug
Resolution: Done
Priority: Blocker
Fix Version/s: RHOAI_2.9.0
Affects Version/s: None
Component/s: Model Server and Metrics
Labels:
- RC2

Story Points:
1
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Regression:
No
Test Blocker:
No
Git Pull Request:
https://github.com/opendatahub-io/kserve/pull/280, https://github.com/opendatahub-io/kserve/pull/288
Intelligence Requested:
Market:

Original story points:
1
Sprint:
Model Serving Sprint 2.9-2

Affects Testing:

Testable

SFDC Cases Links:
SFDC Cases Counter:
SFDC Cases Open:

Overview
Within BAM and watsonx.ai, Raw Deployments need to be fronted by a routing component. Currently, the FMaaS/Rust router (and Caikit) client-side load balance and proxy requests across a model deployment's pods/replica's. To do so they utilizes a Headless service that sits in between itself and the replica's, queries addresses to the physical pods, and round robins requests.

Issue

When you scale a model deployment up to more than a single pod, w/o the service being configured as headless, all of the requests will flow to only the first pod in the scaled deployment

Acceptance Criteria

As part of the Raw mode deployment process (CR submission) there needs to be a way to configure whether the resultant Service has a Cluster IP (which is supported today) vs having a Cluster IP of None (headless, not supported today).

- - Sort By Name
  - Sort By Date
  - Ascending
  - Descending
  - Thumbnails
  - List
  - Download All

kserve-controller-manager-579847cd74-826cv-manager.log
433 kB
2024/04/15 3:53 PM
flan-t5-xxl-hf-predictor-54d68b4c4d-qbxkj-kserve-container.log
13 kB
2024/04/15 3:53 PM

split to

RHOAIENG-5077 [Follow-up-upstream tracker] Routing and Headless Service Support in KServe Raw Mode Deployment

In Progress

links to

opendatahub-io/kserve#280: [RHOAIENG-5073] - Routing and Headless Service Support in KServe Raw …

opendatahub-io/kserve#288: [RHOAIENG-5073] - Routing and Headless Service Support in KServe Raw Mode Deployment

red-hat-data-services/kserve#220: [Cherry-pick][RHOAIENG-5073] - Routing and Headless Service Support in KServe Raw Mode Deployment

RHBA-2024:129615 RHOAI 2.8.1 - Red Hat OpenShift AI

Assignee:: Filippe Spolti

Reporter:: Taneem Ibrahim

QA Contact:: Tarun Kumar

Team:: RHOAI Model Server and Serving Metrics

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Created:: 2024/04/15 3:53 PM

Updated:: 2024/04/16 1:50 PM

Resolved:: 2024/04/15 3:54 PM

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates

PagerDuty