Skip to content

csana23/cv_rag_local

Repository files navigation

cv_rag_local

An LLM-based RAG project that enables users to ask questions about the content of multiple candidates' CVs/resumes: their work experience, skills, other elements of their professional career and comparison to other candidates

  • llama3 as LLM and nomic-embed-text as embedding model, both running in separate ollama instances
  • chromadb vector database for storing user input-, and PDF file embeddings, running in server mode
  • RAG component with cosine similarity search followed by ColBERT reranking and metadata inclusion in retrieved document chunks
  • FastAPI backend to process user prompts and initiate LangChain process
  • Streamlit frontend
  • deployed as a Docker container, able to utilize Nvidia GPUs
  • also available with OpenAI's gpt-3.5 model with added LangChain agent functions

demo link: https://www.youtube.com/watch?v=M--grymJ7ic

How to run

Create .env file in projects root folder:

AGENT_MODEL=llama3
EMBEDDING_MODEL=nomic-embed-text
CHATBOT_URL=http://host.docker.internal:8005/cv-rag-agent

Add your pdfs to cv_rag/chromadb_etl/data folder

Build with docker:

docker-compose up --build

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published