Loading...

XML

Word

Printable

Type: Story
Resolution: Obsolete
Priority: Normal
Fix Version/s: July Release for PSAP
Affects Version/s: None
Component/s: RHODS
Labels:

Epic Link:
RHODS GPU Testing
Ready:
False
Blocked:
False

SFDC Cases Counter:
SFDC Cases Links:
SFDC Cases Open:

Market:

Release Note Text:
Undefined

Build on top of on-going Multi Instance GPU performance benchmarking to show how GPU utilization can be improved by slicing the GPU (A100 or the A30 card) and assign individual instances to multiple model serving applications running in parallel (triton may be used for model serving).

Assignee:: Yuchen Fama

Reporter:: Ashish Kamra

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Created:: 2021/08/03 5:49 PM

Updated:: 2026/01/30 7:44 PM

Resolved:: 2022/12/08 6:49 PM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates