Loading...

XML

Word

Printable

Type: Epic
Resolution: Unresolved
Priority: Undefined
Fix Version/s: None
Affects Version/s: None
Component/s: Model Serving
Labels:
- MLServing
- mlserving

Epic Name:
Model Monitoring and Metrics - Model Serving v2
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Affects Testing:

Testable
Automated:
No
Epic Status:
To Do
Hierarchy Progress Bar:

22% To Do, 0% In Progress, 78% Done
Test Blocker:
No
Test Coverage:

Pending
Watchlist Impact:
None
Intelligence Requested:
Market:
PX Impact Score:

SFDC Cases Links:
SFDC Cases Counter:
SFDC Cases Open:

Reqs doc (covered as part of broader model serving v2 reqs): https://docs.google.com/document/d/1TXLEyzpYX6inMHOlaUW8VbMxMvQ5Y9TEzHM87G9_1Yk/edit?usp=sharing

From Jeff: It would be good to get a basic metric that provides insight into whether customers are using the feature. For example, the number of deployed models at the cluster level.
We just need to determine what metric would work for us and add it to the rhods rules at https://github.com/red-hat-data-services/odh-manifests/blob/master/monitoring/base/rhods-rules.yaml
Part of the broader R11:
Inference performance metrics. Users must be able to access performance metrics for all deployed models # P0:: Avg. response time over period of time (eg. last 24 hours or last week/month to gauge trends over time) at the individual model level

P0: Number of requests over defined period of time (including option for all time) at the individual model level
P0: Ability to view metrics at both the individual model and model server levels
P0: CPU/GPU/memory utilization

P0: configurable alerts based on defined thresholds:

Avg. response time
CPU/GPU/memory utilization
Number of requests (eg. above or below or certain threshold)

TBD: number of errors / failures in defined time period

is incorporated by

RHODS-6130 Model Serving v2

In Progress

Assignee:: Unassigned

Reporter:: Vedant Mahabaleshwarkar

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Created:: 2023/02/15 9:04 PM

Updated:: 2025/06/11 11:42 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates

Hide