Demo Video: Watch

Multi-Document RAG Agent Architecture

The Multi-Document RAG Agent is an intelligent query routing system built to retrieve and process information from large sets of documents efficiently. It uses a Retrieval-Augmented Generation (RAG) framework to select the most relevant documents and passages, significantly improving query response accuracy and efficiency. By generating concise summaries for each document, the system creates a robust mapping mechanism that aligns user queries with the most relevant documents. These summaries encapsulate the core content of documents, streamlining the retrieval process and enabling precise targeting of relevant data.

This diagram illustrates the passage and summary embeddding creation process.

Features

Smart Document Selection: Utilizes vector similarity search combined with document summaries to effectively map queries to the most relevant documents.
Granular Retrieval: Breaks documents into passages for fine-tuned retrieval, further narrowing down the search space.
Adaptive Query Processing: Differentiates between general and document-specific queries for optimal handling.
Efficient Token Usage: Leverages document summaries to minimize unnecessary resource consumption, focusing only on high-relevance data.

Technologies Used

LLM: Google Gemini
Framework: LangChain
Vector Database: Pinecone
Language: Python

How It Works

This diagram illustrates the document selection and retrieval process.

Data Preparation

Document Summarization:
- Summarize documents using LangChain.
- Generate embeddings for summaries and store them in a Pinecone namespace summary_embeddings with a unique document ID.
Passage Embedding:
- Split documents into 1000-word chunks.
- Convert these passages into embeddings and store them in the passages_embeddings namespace, linked with document IDs.

Query Processing

Query Classification:
- Use a one-shot classifier to determine if a query is general or requires document retrieval.
Summary Matching:
- For document queries, generate embeddings and perform similarity searches on summary_embeddings.
- Retrieve the top 3 document summaries.
Document Selection:
- Use the retrieved summaries to identify relevant document IDs via a document_selection_prompt.
Passage Matching:
- Extract the top 10 relevant passages from the selected documents using similarity search on the user query and passage embeddings.
Final Response Generation:
- Use the retrieved passages and the query to generate the final response.

Key Advantages

Efficiency: Reduces unnecessary token consumption by narrowing down search results.
Flexibility: Handles multi-document queries and retrieves answers spanning multiple documents.
Scalability: Optimized for large document sets with vector search.

Limitations

Edge Cases: Struggles with large-scale queries requiring data from many documents (e.g., insights from 40+ documents simultaneously).
Potential Information Loss: When multiple documents are retrieved, combining passages may omit some details.

Demo Video: Watch

Note: This project was developed for an assignment provided by RaccoonAI, a Bangalore based startup for SDE(AI) role.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
MultiDoc-RAG.ipynb		MultiDoc-RAG.ipynb
MultiDoc-RAG_Single_Run_Code.ipynb		MultiDoc-RAG_Single_Run_Code.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Document RAG Agent Architecture

Features

Technologies Used

How It Works

Data Preparation

Query Processing

Key Advantages

Limitations

Demo Video: Watch

About

Uh oh!

Releases

Packages

Languages

abhishekgit03/Multi-Document-RAG-Agent

Folders and files

Latest commit

History

Repository files navigation

Multi-Document RAG Agent Architecture

Features

Technologies Used

How It Works

Data Preparation

Query Processing

Key Advantages

Limitations

Demo Video: Watch

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages