Loading...

XML

Word

Printable

Type: Initiative
Resolution: Duplicate
Priority: Undefined
Fix Version/s: None
Affects Version/s: None
Component/s: Model Validation
Labels:
None

Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

Intelligence Requested:
Market:

Extend JBenchmark to support benchmarking across a diverse set of cloud instance types to evaluate how generative AI models perform under various hardware environments. This enables teams to make data-driven deployment decisions based on performance, cost, and scalability tradeoffs across clouds and GPU SKUs.

This task includes:

Benchmark orchestration on instances from GCP, AWS, Azure, and on-prem

Support for different GPU models (e.g., A100, H100, L4, AMD MI300, IBM SPU, etc.)

Capturing full instance metadata (cloud vendor, machine type, cost/hour, region, etc.)

Aggregating results by instance family to enable performance/cost comparisons

Goals:

Provide customers and internal teams with comparative metrics across:

- Different cloud providers

- GPU types

- Instance shapes (single-GPU, multi-GPU, CPU fallback, etc.)

Validate models and inference configurations in environments aligned with real-world deployments

Acceptance Criteria:

Benchmark can be triggered across at least 3 major cloud providers

Each run is tagged with cloud/instance metadata

Assignee:: Aviran Badli

Reporter:: Aviran Badli

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Created:: 2025/06/26 6:52 PM

Updated:: 2025/08/10 7:15 PM

Resolved:: 2025/08/10 7:15 PM

Details

Description

Goals:

Acceptance Criteria:

Attachments

Easy Agile Planning Poker

Activity

People

Dates

PagerDuty