Loading...

XML

Word

Printable

Type: Story
Resolution: Done
Priority: Normal
Fix Version/s: RHODS_1.20.0_GA
Affects Version/s: None
Component/s: UI
Labels:
- FY22
- MLServing
- ODH
- UI
- eng
- groomed

Epic Link:
Model serving v1
Blocked:
False
Blocked Reason:
None
Ready:
False
Automated:
Yes
CDW devel_ack:
CDW docs_ack:
CDW pm_ack:
CDW qa_ack:
CDW release:
Fixed in Build:
1.20.0-z
Regression:
No
Target Release:

RHODS_1.20.0_GA
Test Blocker:
No
Test Coverage:

Yes
Watchlist Impact:
None
Git Pull Request:
https://github.com/opendatahub-io/odh-dashboard/pull/762

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Bugs

[Bug]: DSG Project creation disables label for modelmesh #744
Targeted for 1.21 -> [Model Serving]: Enable GPU in Model Server configuration #703

Requirement 1

P0: Users must be able to configure a server for the model
P1: Specify target platform configuration (eg. compute resources - CPU, memory, GPU) for served models

Issues

[Model Serving]: Allow Model Serving Optionally #639

[Model Serving]: Configure Model Server #642

[Model Serving]: Add ModelServerSize to OdhDashboardConfig #708

Requirement 2

P0: Model storage. Users must be able to to deploy a model stored in a S3 location
P0: Model frameworks: Users must be able to serve models based on a variety of frameworks
P0: Ability to serve models not developed in RHODS

Using frameworks from A
Stored in locations in [Model Serving]: Allow configuring the server #641

Issues

Requirement 3

P0: Ability to view list of deployed models for a project

Ability to access endpoint
Ability to view monitoring and performance metrics

P0: The system must help indicate the health (are they up) of endpoints for deployed models
P1: Support multi-model serving; ability to serve multiple models on one server

Requirement 4

P0: Users must be able to easily retrieve the endpoint for a served model (to use for inference, either testing or incorporating into an app)

P0: Users must be able to secure endpoints so they are not publicly available: Authentication & authorization capabilities

Issues

[Model Serving]: Support Visualization of the Deployed Model #657

Requirement 5

P0: Ability to view global list of all deployed models (across all projects)

Filtering / search capabilities
Users view all models deployed within projects they have access to, admins view all

Issues

[Model Serving]: Add Global View of Deployed Models #653

Requirement 6

P1: Ability to delete a model

Issues

[Model serving]: Support to Delete a Deployed Model #654

Requirement 7

P0: Manually add a new version for served model & deploy (replace)
P0: Edit model server
P1: Deploy the new version of the model - exist with previous; multiple deployed endpoints ----> TODO: Review

Issues

Requirement 8 (Targeted for 1.21)

P0: Inference performance metrics. Users must be able to access performance metrics for all deployed models

P0: Inference performance - latency (avg. time to process 1 input)
P0: Target metrics for v1:
- Avg. response time over period of time (eg. last 24 hours or last week/month to gauge trends over time)
- Number of requests over defined period of time (including option for all time)

Issues

[Model Serving]: Support for Visualization of Metrics for the Server #656

clones

RHODS-4620 UX for Serving Models

Closed

is cloned by

RHODS-4622 UI back end for Serving Models in ODH core

Closed

Assignee:: Lucas Fernandez Aragon

Reporter:: Jacqueline Koehler

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2022/07/13 5:49 PM

Updated:: 2023/02/17 9:13 PM

Resolved:: 2022/11/23 2:10 PM

Details

Description

Bugs

Requirement 1

Issues

Requirement 2

Issues

Requirement 3

Requirement 4

Issues

Requirement 5

Issues

Requirement 6

Issues

Requirement 7

Issues

Requirement 8 (Targeted for 1.21)

Issues

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates