How to make MDocAgent work for local pdf database in an interactive manner?

Thank for your excellent work!!!

I have some questions about user interaction with MDocAgent.

I think MDocAgent currently only works for some benchmark.
How can I use MDocAgent in real-world pdf RAG scenarios?
Suppose I have 500 PDFs, then preprocess them to get page text and page images, that's the offline part.
Then consider the online part!
1. A user inputs a query.
2. Retrieval tools should retrieve the topk images and topk texts.
3. then the MDocAgent should generate the answer based on the retrieved text, retrieved images and the user's query.
4. Finally, print the answer to the terminal or save as json, etc.

These are some issues I've met when trying to apply MDocAgent to my own pdf database for RAG.
It's possible I've missed something, but could you please offer some guidance about the interaction?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to make MDocAgent work for local pdf database in an interactive manner? #6

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

How to make MDocAgent work for local pdf database in an interactive manner? #6

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions