Uploaded image for project: 'Red Hat Enterprise Linux AI'
  1. Red Hat Enterprise Linux AI
  2. RHELAI-2971

[ilab] Modularize ingestion and pre-processing pipeline, SDG and RAG libraries

XMLWordPrintable

    • Icon: Outcome Outcome
    • Resolution: Unresolved
    • Icon: Undefined Undefined
    • None
    • None
    • None
    • None
    • 80% To Do, 0% In Progress, 20% Done
    • Not Selected
    • False
    • Hide

      None

      Show
      None

      SDG and RAG today consume the input data directly, use docling and further format it to prepare the data prior to kicking off their respective workflows. 

      We want to pull out the data ingestion and pre-processing functions into the core library so both SDG and RAG can consume it (and, eventually other libraries like Eval etc).

      A part of this work will be done in 1.4 where the SDG library (where this is housed today) will split it into three different APIs: pre-processing, SDG and post-processing. It will continue to live in the SDG library, and we'll start transitioning it to Core in phases. 

              Unassigned Unassigned
              jepandit@redhat.com Jehlum Vitasta Pandit
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated: