Loading...

XML

Word

Printable

Type: Story
Resolution: Done
Priority: Undefined
Fix Version/s: None
Affects Version/s: None
Component/s: Model Serving
Labels:
- MLServing
- mlserving

Epic Link:
Model Serving v2
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Acceptance Criteria:
None
Affects Testing:

Testable
Automated:
No
CDW blocker:
CDW pm_ack:
CDW release:
Regression:
No
Target Release:

FUTURE_GA
Test Blocker:
No
Test Coverage:

Pending
Watchlist Impact:
None
Intelligence Requested:
Market:

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

From Jeff: It would be good to get a basic metric that provides insight into whether customers are using the feature. For example, the number of deployed models at the cluster level.
We just need to determine what metric would work for us and add it to the rhods rules at https://github.com/red-hat-data-services/odh-manifests/blob/master/monitoring/base/rhods-rules.yaml
Part of the broader R11:
Inference performance metrics. Users must be able to access performance metrics for all deployed models # P0:: Avg. response time over period of time (eg. last 24 hours or last week/month to gauge trends over time) at the individual model level

P0: Number of requests over defined period of time (including option for all time) at the individual model level
P0: Ability to view metrics at both the individual model and model server levels
P0: CPU/GPU/memory utilization

P0: configurable alerts based on defined thresholds:

Avg. response time
CPU/GPU/memory utilization
Number of requests (eg. above or below or certain threshold)

TBD: number of errors / failures in defined time period

Assignee:: Vedant Mahabaleshwarkar

Reporter:: Vedant Mahabaleshwarkar

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Created:: 2023/02/15 7:57 PM

Updated:: 2023/02/16 4:13 PM

Resolved:: 2023/02/16 4:13 PM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates