Uploaded image for project: 'Red Hat Enterprise Linux AI'
  1. Red Hat Enterprise Linux AI
  2. RHELAI-2362

Precomputed Dataset Filtering

XMLWordPrintable

    • False
    • Hide

      None

      Show
      None
    • False
    • Not Selected
    • RHELAI-2361End user is successful with Custom Skills Only Worfklow
    • 100% To Do, 0% In Progress, 0% Done

      Feature Overview (mandatory - Complete while in New status)

      • Precomputed skill dataset is massive in size and might lead to new user skills not being learned effectively if they are present in the precomputed dataset

      Goals (mandatory - Complete while in New status)
      Provide high-level goal statement, providing user context and expected user outcome(s) for this Feature

      • Allow filtering the precomputed dataset to remove specific tags for skills from precomputed dataset that user wants to add themselves 
      • Enable user's skills to be learned more effectively

      Requirements (mandatory -_ Complete while in Refinement status):

      • Research : Tag the precomputed dataset
      • Engineering: Have the ability to run filters in the mixing stage

      Done - Acceptance Criteria (mandatory - Complete while in Refinement status):

      • Precomputed dataset can be tagged
      • Filtering mechanism/knobs in post processing SDG stage

      Use Cases - i.e. User Experience & Workflow: (Initial completion while in Refinement status):

      Out of Scope {}{}(Initial completion while in Refinement status):

      Documentation Considerations {}{}(Initial completion while in Refinement status):
      Provide information that needs to be considered and planned so that documentation will meet customer needs. If the feature extends existing functionality, provide a link to its current documentation..
      <your text here>

       

      Questions to Answer {}{}(Initial completion while in Refinement status):

      • [Shiv] Precomputed datasets are not tagged with tasks, so we need to use an llm to tag the samples.  Until that happens we wont be able to enable dataset filtering
      • This is an advanced user knob, do we want to enable this for end user of Instruct Lab/RHEL AI or only document how to do this for field teams ? -  Yes, we want to introduce this in the product as well

      Background and Strategic Fit (Initial completion while in Refinement status):

      • IBM Sales teams are trying to do POCs and running into challenges due to this issue. 

      Customer Considerations {}{}(Initial completion while in Refinement status):

              rh-ee-asaluja Aditi Saluja
              rh-ee-asaluja Aditi Saluja
              Votes:
              0 Vote for this issue
              Watchers:
              1 Start watching this issue

                Created:
                Updated: