Loading...

XML

Word

Printable

Type: Feature
Resolution: Unresolved
Priority: Normal
Fix Version/s: ols-1.1
Affects Version/s: None
Component/s: None
Labels:

Work Type:
Strategic Product Work
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Parent Link:
OCPSTRAT-895Openshift LightSpeed GA
Hierarchy Progress Bar:

83% To Do, 17% In Progress, 0% Done

Risk Score:
0

Discussion Needed:

Program Call

SFDC Cases Links:
SFDC Cases Counter:
SFDC Cases Open:

Intelligence Requested:
Market:

Background

A high-quality RAG process focuses on three areas of optimization:

Contextualized splitter function
Embedding techniques and rich metadata
Retrieval techniques

This Feature card is about point number 3. The idea is to move beyond Naive RAG into advanced retrieval by identifying the best combination of techniques.

Deliverables

The Feature should evaluate techniques like:

RAG Evaluate retrieval performance and quality using embedding from filesystem vs vector database (chromadb)
- This may have a dependency or impact on OLS-120
RAG Evaluate retrieval performance using Parent Document Retriever
- Example: https://blog.lancedb.com/modified-rag-parent-document-bigger-chunk-retriever-62b3d1e79bc6
RAG Evaluate retrieval performance by using Re-Ranking
- Example: https://blog.lancedb.com/simplest-method-to-improve-rag-pipeline-re-ranking-cf6eaec6d544
RAG Evaluate retrieval performance by using query rewriting
- Reference paper: https://arxiv.org/pdf/2305.14283.pdf
RAG Evaluate the quality of LLM answers using RAG summarization
- For this technique, the prompt sent to the LLM is augmented with a summary of the retrieved documents/chunks instead of sending the retrieved chunks.
- Evaluate any improvement in token utilization when using this technique.
RAG Evaluate the quality of retrieval when using labeled topics metadata filtering

From the evaluations then, select the best candidate for:

Create Advanced RAG chain definition
- Example of an advanced RAG chain:
  - Rewrite prompt 3x (q1, q2, q3) > Retrieve top-3 per prompt > prioritize documents matched by questions > Ensemble > ReRank > Summarize > Augment Prompt > send prompt to LLM

Assignee:: Gaurav Singh

Reporter:: William Caban

Votes:: 0 Vote for this issue

Watchers:: 3 Start watching this issue

Created:: 2024/02/18 7:04 AM

Updated:: 2024/11/22 12:33 AM

Details

Description

Attachments

Easy Agile Planning Poker

Activity

People

Dates