RAG ENGINE

A powerful Retrieval-Augmented Generation (RAG) system for PDF documents using LlamaParse, LangChain, and Groq. This API allows you to upload PDF files, process them into a searchable knowledge base, and query them using natural language.

Features

📄 Multi-PDF Processing: Upload and process multiple PDF files simultaneously
🔍 Intelligent Document Parsing: Uses LlamaParse for high-quality PDF text extraction
🧠 Smart Chunking: Automatically splits documents into optimal chunks for retrieval
🔎 Semantic Search: Vector-based similarity search using HuggingFace embeddings
💬 Natural Language Queries: Ask questions in plain English and get relevant answers
📚 Source Attribution: Get references to the source documents for each answer
💾 Persistent Storage: Vector database persists between sessions using ChromaDB
⚡ Fast Inference: Powered by Groq for quick response times

Architecture

┌─────────────────┐    ┌──────────────────┐    ┌─────────────────┐
│   PDF Upload    │───▶│   LlamaParse     │───▶│  Text Chunking  │
└─────────────────┘    └──────────────────┘    └─────────────────┘
                                                         │
┌─────────────────┐    ┌──────────────────┐    ┌─────────────────┐
│  User Queries   │◀───│   Groq LLM       │◀───│ Vector Database │
└─────────────────┘    └──────────────────┘    │   (ChromaDB)    │
                                                └─────────────────┘
                                                         ▲
                                                         │
                                                ┌─────────────────┐
                                                │ HuggingFace     │
                                                │ Embeddings      │
                                                └─────────────────┘

Quick Start

Prerequisites

Python 3.8+
LlamaCloud API Key (from LlamaIndex)
Groq API Key (from Groq)

Installation

Clone the repository

git clone https://github.com/anujgawde/rag-engine.git
cd rag-engine

Install dependencies
```
pip install -r requirements.txt
```

Set up environment variables Create a .env file in the root directory:

LLAMA_CLOUD_API_KEY=your_llama_cloud_api_key
GROQ_API_KEY=your_groq_api_key
EMBEDDING_MODEL=sentence-transformers/all-MiniLM-L6-v2
PERSIST_DIR=./chroma_db

Run the application
```
python main.py
```

The API will be available at http://localhost:8000

API Documentation

Upload PDFs

Upload and process PDF files into the knowledge base.

Endpoint: POST /upload-pdfs

Request:

curl -X POST "http://localhost:8000/upload-pdfs" \
  -H "accept: application/json" \
  -H "Content-Type: multipart/form-data" \
  -F "[email protected]" \
  -F "[email protected]"

Response:

{
  "status": "success",
  "message": "Successfully ingested 2 PDF files",
  "files_processed": 2,
  "file_names": ["document1.pdf", "document2.pdf"]
}

Query Documents

Ask questions about your uploaded documents.

Endpoint: POST /query

Request:

curl -X POST "http://localhost:8000/query" \
  -H "accept: application/json" \
  -H "Content-Type: application/x-www-form-urlencoded" \
  -d "question=What are the main topics discussed in the documents?"

Response:

{
  "status": "success",
  "message": "Query processed successfully",
  "query": "What are the main topics discussed in the documents?",
  "answer": "Based on the documents, the main topics include...",
  "sources": [
    {
      "content": "Excerpt from the relevant document...",
      "metadata": {
        "source": "document1.pdf",
        "page": 1,
        "file_name": "document1.pdf"
      }
    }
  ]
}

Interactive API Documentation

Once the server is running, you can access the interactive API documentation at:

Swagger UI: http://127.0.0.1:8000/docs
ReDoc: http://localhost:8000/redoc

Configuration

Environment Variables

Variable	Description	Default
`LLAMA_CLOUD_API_KEY`	API key for LlamaParse service	Required
`GROQ_API_KEY`	API key for Groq LLM service	Required
`EMBEDDING_MODEL`	HuggingFace embedding model	`sentence-transformers/all-MiniLM-L6-v2`
`PERSIST_DIR`	Directory to store vector database	`./chroma_db`

Customization

You can customize various parameters in the PDFRAG class:

Chunk Size: Modify chunk_size in RecursiveCharacterTextSplitter
Chunk Overlap: Adjust chunk_overlap for better context preservation
Retrieval Count: Change k parameter in retriever setup
LLM Model: Switch between different Groq models
Temperature: Adjust creativity vs consistency in responses

Examples

Example 1: Processing Research Papers

# Upload research papers
files = ["paper1.pdf", "paper2.pdf", "paper3.pdf"]
# Query: "What are the main findings across these research papers?"

Example 2: Company Documentation

# Upload company manuals, policies, and procedures
files = ["employee_handbook.pdf", "safety_procedures.pdf", "it_policies.pdf"]
# Query: "What is the company's remote work policy?"

Example 3: Legal Documents

# Upload contracts and legal documents
files = ["contract1.pdf", "terms_of_service.pdf", "privacy_policy.pdf"]
# Query: "What are the termination clauses in these contracts?"

Troubleshooting

Common Issues

"No documents have been ingested yet"
- Make sure you've uploaded PDF files using the /upload-pdfs endpoint
- Check if the vector database directory exists and contains data
"Error parsing PDF"
- Ensure your PDF files are not corrupted
- Check if your LlamaCloud API key is valid and has sufficient credits
Slow response times
- Consider using a smaller embedding model for faster processing
- Reduce chunk size or retrieval count for quicker responses
Memory issues
- For large documents, consider increasing chunk size to reduce total chunks
- Monitor system memory usage during processing

API Key Issues

Ensure your API keys are correctly set in the .env file
Check API key validity and quota limits
Verify network connectivity to external services

Contributing

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Acknowledgments

LlamaIndex for excellent PDF parsing capabilities
LangChain for the RAG framework
Groq for fast LLM inference
ChromaDB for vector storage
HuggingFace for embedding models

Support

If you encounter any issues or have questions, please:

Check the troubleshooting section
Search existing GitHub issues
Create a new issue with detailed information about your problem

⭐ If you find this project helpful, please consider giving it a star!

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.idea		.idea
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
services.py		services.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG ENGINE

Features

Architecture

Quick Start

Prerequisites

Installation

API Documentation

Upload PDFs

Query Documents

Interactive API Documentation

Configuration

Environment Variables

Customization

Examples

Example 1: Processing Research Papers

Example 2: Company Documentation

Example 3: Legal Documents

Troubleshooting

Common Issues

API Key Issues

Contributing

Acknowledgments

Support

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RAG ENGINE

Features

Architecture

Quick Start

Prerequisites

Installation

API Documentation

Upload PDFs

Query Documents

Interactive API Documentation

Configuration

Environment Variables

Customization

Examples

Example 1: Processing Research Papers

Example 2: Company Documentation

Example 3: Legal Documents

Troubleshooting

Common Issues

API Key Issues

Contributing

Acknowledgments

Support

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages