Loading...

XML

Word

Printable

Type: Feature
Resolution: Unresolved
Priority: Normal
Fix Version/s: ols-1.1
Affects Version/s: None
Component/s: Lightspeed
Labels:

Work Type:
Strategic Portfolio Work
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Parent Link:
OCPSTRAT-895Openshift LightSpeed GA
Hierarchy Progress Bar:

0% To Do, 50% In Progress, 50% Done

Risk Score:
0

Discussion Needed:

Program Call

SFDC Cases Links:
SFDC Cases Counter:
SFDC Cases Open:

Intelligence Requested:
Market:

Different backend LLMs lead to different experiences with OLS. We need to provide a quantitative score about how well OLS performs with one LLM over another as a guide for the customers choosing one over the other.

The scoring system should be based on OLS functionalities or capabilities, not as a general-purpose score.

Scoring system to describe the quality of answers when using the model
Scoring system to evaluate and measure relevancy and correctness of answers

links to

openshift/lightspeed-service#2012: eval: add recent eval result

Assignee:: Gaurav Singh

Reporter:: William Caban

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Created:: 2024/01/08 7:05 PM

Updated:: 2025/02/11 10:07 PM

Details

Description

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates