Loading...

XML

Word

Printable

Type: Feature
Resolution: Done
Priority: Critical
Fix Version/s: ols-1.0
Affects Version/s: None
Component/s: Lightspeed
Labels:

Work Type:
Strategic Portfolio Work
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Parent Link:
OCPSTRAT-895Openshift LightSpeed GA
Hierarchy Progress Bar:

0% To Do, 4% In Progress, 96% Done

Risk Score:
0

Discussion Needed:

Program Call

SFDC Cases Links:
SFDC Cases Open:
SFDC Cases Counter:

Intelligence Requested:
Market:

Goal

Evaluate the quality of answers provided by the OpenShift Lightspeed (OLS) AI assistant for product-related questions.

Timeline

August/30/2024

Purpose

The purpose of this feature is to develop a method for evaluating the quality of responses given by OpenShift Lightspeed. We aim to create a "golden set" of questions and answers reviewed by human experts for each product area. This set will serve as a standard of excellence, helping us compare and understand the quality of OLS outputs. This internal OLS feature will be used by the OLS team to assess response quality and formulate a plan for enhancement.

Overview

The OLS team has created a list of synthetically generated questions and answers for each OpenShift product area, referred to as the golden set. Each OCP team will be assigned an Epic to review and correct the list of questions related to their product area.

These golden set of questions and answers (hosted at Q&A DocumentLink) will serve as the baseline for evaluating the answers generated by OLS.

Background

To provide accurate responses, OLS takes user prompts, searches relevant information in OCP documentation (RAG), and then summarizes it using an LLM.

Requirement/Request

We need each product team to:

Review and correct the questions.
Review and correct the answers.
Add additional relevant questions and answers.

Feedback on Received Questions

Since the questions have been synthetically generated by an AI system, we've received feedback that:

Some questions are not related to the assigned product area. If you encounter such questions, either assign them to the relevant product team by moving the EPIC in their product board or reach out to @Gaurav Singh.
Some questions may not be valid or might appear awkward. As part of our request, please correct these questions and add more relevant ones.

How We Will Use the "Answer Quality Metrics"

Based on our findings, actions may range from low-effort tasks like updating product information in documentation to high-effort tasks like fine-tuning the model. We will evaluate and prioritize these actions accordingly.

relates to

OCPSTRAT-1813 Not reviewed : Openshift Lightspeed Answer quality metric - phase-2

In Progress

Assignee:: Gaurav Singh

Reporter:: Gaurav Singh

Contributors:: Adel Zaalouk, Ali Mobrem, Anjali Telang, Boaz Michaely, Daniel Messer, Deepthi Dharwar, Doron Caspin, Duncan Hardie, Eran Tamir, Erwan Gallen, Gregory Charot, Harriet Lawrence, Jamie Parker, Ju Lim, Marc Curry, Marcos Entenza Garcia, Naina Singh, Nick Png, Peter Lauterbach, Quiana Berry (Inactive), Radek Vokal, Ramon Acedo, Roger Florén, Ronen Sde-Or, Sachin Mullick, Siamak Sadeghianfar, Tony Wu, William Caban

Architect:: XAVIER RAJESH DHARMAIYAN

Product Manager:: GAURAV SINGH (Inactive)

Votes:: 0 Vote for this issue

Watchers:: 14 Start watching this issue

Due:: 2024/09/08

Created:: 2024/07/09 6:05 PM

Updated:: 2025/02/11 10:54 PM

Resolved:: 2024/10/30 5:32 PM

Target end:: 2025/01/04

Details

Description

Goal

Purpose

Overview

Background

Requirement/Request

Feedback on Received Questions

How We Will Use the "Answer Quality Metrics"

Attachments

Issue Links

Easy Agile Planning Poker

Activity

People

Dates