XML

Word

Printable

Type: Task
Resolution: Done
Priority: Major
Fix Version/s: 1.9.0
Affects Version/s: None
Component/s: Knowledge, Lightspeed
Labels:
None

Story Points:
5
Epic Link:
[Lightspeed] Personal AI Notebooks Feature
Blocked:
False
Blocked Reason:

Hide

None

Show
None
Ready:
False
Git Pull Request:
https://github.com/JslYoon/RHDH-AI-Notebooks/blob/JslYoon-ai-notebooks/workspaces/lightspeed/AI-NOTEBOOKS.md
Intelligence Requested:
Market:

Sprint:
RHDH AI Sprint 3284, RHDH AI Sprint 3285, RHDH AI Sprint 3286

SFDC Cases Counter:
SFDC Cases Open:
SFDC Cases Links:

Task

As an engineer working in the "AI Notebooks" feature, I need to make a component where given a url, pdf, doc, docx, txt, md, or json it will extract the document string and pass it on to the document rag chunk generator after safety checking.

For extensions doc, docx, txt, md, and json, the component should clean and delete necessary tokens.

For pdf, the scope will only contain native pdf (not scanned pdf) to convert into doc.

For url, it will be only the specific url page content to be security checked and added to the vector database.

Ensure security and stability

Background

Dependencies and Blockers

QE impacted work
Documentation impacted work

Acceptance Criteria

Assignee:: Lucas Yoon

Reporter:: Lucas Yoon

Team:: RHDH AI

Votes:: 0 Vote for this issue

Watchers:: 1 Start watching this issue

Created:: 2025/12/10 4:30 PM

Updated:: 2026/01/20 2:37 PM

Resolved:: 2026/01/20 2:37 PM

Details

Description

Task

Background

Dependencies and Blockers

Acceptance Criteria

Attachments

Easy Agile Planning Poker

Activity

People

Dates