GitHub

Data can be found here

Execute run.py to obtain the RAG results.

Chunking

We currently support the following chunking methods in split_text.py:

RecursiveCharacterTextSplitter
CharacterTextSplitter
SemanticChunker

Retrieval Stage

In the retrieval stage, we support (see retrieval_utils.py):

Retrieving similar documents to a given query
Using the Hyde method, which first generates a synthetic document that is then compared with the index

Evaluation

For evaluation, we also utilize the following objects in metrics.py:

deepeval GEval
FaithfulnessMetric

Set Environment Variables:

pip install python-dotenv

In the root directory of your project, create a file named .env and add the following variables:

LANGCHAIN_TRACING_V2=true LANGCHAIN_API_KEY="YOUR_LANGCHAIN_KEY" LANGCHAIN_ENDPOINT="https://api.smith.langchain.com" LANGCHAIN_PROJECT="YOUR_PROJECT_NAME" OPENAI_API_KEY="YOUR_OPENAI_KEY"

Replace each placeholder (e.g., YOUR_LANGCHAIN_KEY, YOUR_PROJECT_NAME, and YOUR_OPENAI_KEY) with your actual values.

By configuring these environment variables, you can monitor metrics like latency usage, and other performance indicators through the LangSmith dashboard.

Name	Name	Last commit message	Last commit date
Latest commit vasilisvyth fix readme Nov 14, 2024 7d915e7 · Nov 14, 2024 History 12 Commits
.dvc	.dvc	dvc init	Nov 14, 2024
configs	configs	rag pipeline	Oct 28, 2024
data	data	change dvc for new questions	Nov 14, 2024
.dvcignore	.dvcignore	dvc init	Nov 14, 2024
.gitignore	.gitignore	rag pipeline	Oct 28, 2024
README.md	README.md	fix readme	Nov 14, 2024
chain_utils.py	chain_utils.py	rag pipeline	Oct 28, 2024
eval_utils.py	eval_utils.py	add docstring for eval_utils	Nov 14, 2024
index_utils.py	index_utils.py	rag pipeline	Oct 28, 2024
metrics.py	metrics.py	rag pipeline	Oct 28, 2024
model_utils.py	model_utils.py	rag pipeline	Oct 28, 2024
pricing_utils.py	pricing_utils.py	rag pipeline	Oct 28, 2024
retrieval_utils.py	retrieval_utils.py	rag pipeline	Oct 28, 2024
run.py	run.py	fix run.py	Nov 14, 2024
split_text.py	split_text.py	rag pipeline	Oct 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chunking

Retrieval Stage

Evaluation

Set Environment Variables:

About

Releases

Packages

Languages

vasilisvyth/rag

Folders and files

Latest commit

History

Repository files navigation

Chunking

Retrieval Stage

Evaluation

Set Environment Variables:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages