FEATURE: Require multi-PDF support in Question_and_answer.py (tool) of pdf_agent.py (agent) #131

NimeshKotian · 2025-03-05T21:09:10Z

The current implementation of question_and_answer.py only processes one PDF at a time. This feature request proposes adding support for processing multiple PDFs simultaneously, which would streamline workflows for users handling multiple documents.

Many use cases require analyzing multiple documents at once. Handling each PDF individually is inefficient, especially when comparing or aggregating data across several PDFs.

I would like the tool to accept multiple PDF as input— as a list of dictionaries —and process them either sequentially or concurrently. The output should either be consolidated into a single result or clearly separated by document, making it easier for users to work with multiple documents.

We might require one more tool which can decide which pdf's content to use based on user's query.

Multi-PDF support would greatly benefit users such as researchers and analysts who regularly work with multiple documents. This enhancement would save time and simplify the process of comparing or extracting information from several PDFs simultaneously.

gurdeep330 · 2025-03-10T14:55:54Z

Hi @NimeshKotian

Thanks for creating the issue! 🚀

To enhance this iteration, let's integrate NVIDIA's RAPIDS library. Specifically, we should explore incorporating FAISS with cuVS for GPU-accelerated vector search.

For reference, NVIDIA has a detailed technical blog on cuVS and FAISS optimizations. Additionally, a useful FAISS tutorial from LangChain might help with implementation.

Let me know what you think!

dmccloskey · 2025-03-12T19:38:29Z

@NimeshKotian A team from our recent hackathon developed several scripts using FAISS #137. Perhaps these could be a useful starting point for simple index building and saving using FAISS.

NimeshKotian added the enhancement New feature or request label Mar 5, 2025

NimeshKotian mentioned this issue Mar 5, 2025

Feat/talk2scholars: PDF Agent & Question and Answer Tool #115

Merged

17 tasks

gurdeep330 assigned NimeshKotian Mar 5, 2025

gurdeep330 added the T2S label Mar 5, 2025

gurdeep330 added this to the Talk2Scholars milestone Mar 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEATURE: Require multi-PDF support in Question_and_answer.py (tool) of pdf_agent.py (agent) #131

FEATURE: Require multi-PDF support in Question_and_answer.py (tool) of pdf_agent.py (agent) #131

NimeshKotian commented Mar 5, 2025

gurdeep330 commented Mar 10, 2025

dmccloskey commented Mar 12, 2025

FEATURE: Require multi-PDF support in Question_and_answer.py (tool) of pdf_agent.py (agent) #131

FEATURE: Require multi-PDF support in Question_and_answer.py (tool) of pdf_agent.py (agent) #131

Comments

NimeshKotian commented Mar 5, 2025

gurdeep330 commented Mar 10, 2025

dmccloskey commented Mar 12, 2025