📚 LLM Program Evaluation Critique + Local-RAG

Authors: Henry Luan, Jesse Mendoza, Amin Kolahan

Setup for the Streamlit App

Follow the setup process of Local-RAG.
Set up a conda environment named conda-env-rag-3-10-14 using conda-env-rag-3-10-14.yaml and modify run_app.py to launch the application without using the command line.

Feature Changes Implemented

Preload Evaluation Reports: Updated local_files.py to read files directly from the cleaned-evaluation-reports folder.
Prompt Template: Created a critique template for program evaluation reports in utils.ollama, based on the Program Evaluation Checklist.
Run App Script: Added run_app.py to launch the Streamlit app without requiring multiple command-line inputs.

To-Do List

Shorten File Names: Make file names shorter because Windows OS doesn't load long file paths.
Separate Scraping Service: Make the scraping a separate microservice and store the data in the cloud; merge Jesse's and Amin's scraping pipelines.
Cloud Vectorstore: Consider creating a cloud vectorstore to reduce load-time.
Improve Retrieval Process: Integrate hybrid search and RRF retriever within the llama_index framework.
- Consider Solr and ElasticSearch and apply advanced tuning techniques to optimize search (e.g. include dictionaries).
- Include metadata; the LLM cannot precisely identify uploaded documents (maybe a limit of the context windows or retrieval process).
- Fine-tune embeddings and the LLM with instructions (source).
Enhance LLM Summarization:
- Use best practices in prompt engineering
- The LLM struggles to answer "what are the files I uploaded?"
- Use various prompt templates based on users' questions and their similarity to templates.
- Explore best practices in design and leveraging agents.
Quality Measurement: Measure the quality of the retrieval and LLM summarization processes.
Human Feedback: Include a human feedback process to improve the LLM.
Deployment Optimization: Create a deployment process (cloud or server) to optimize GPU and RAM, reducing latency.

What I Tried/Learned

S3 Storage Bucket/Cloud storage: Setting up S3 bucket where each Streamlit instance downloads files from S3 and makes a local copy is resource-intensive. A proper database is needed for lowering cost/reducing latency.

Known Bug

Indexing Issue: Local-RAG has a bug where new documents do not get indexed after the initial upload. Refer to issue #61 for more details.

2024-07-06 21:17:41,319 - rag_pipeline - INFO - Documents are already available; skipping document loading
2024-07-06 21:17:41,320 - llama_index - INFO - Query Engine created successfully

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.streamlit		.streamlit
cleaned-evaluation-reports		cleaned-evaluation-reports
components		components
docs		docs
utils		utils
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
conda-env-rag-3-10-14.yaml		conda-env-rag-3-10-14.yaml
demo.gif		demo.gif
docker-compose.yml		docker-compose.yml
docker-compose.yml-rocm		docker-compose.yml-rocm
evaluation-ai-backend.png		evaluation-ai-backend.png
evaluation-ai-pro-demo.gif		evaluation-ai-pro-demo.gif
logo.png		logo.png
main.py		main.py
run_app.py		run_app.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📚 LLM Program Evaluation Critique + Local-RAG

Setup for the Streamlit App

Feature Changes Implemented

To-Do List

What I Tried/Learned

Known Bug

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

casualcomputer/evaluation-ai-pro

Folders and files

Latest commit

History

Repository files navigation

📚 LLM Program Evaluation Critique + Local-RAG

Setup for the Streamlit App

Feature Changes Implemented

To-Do List

What I Tried/Learned

Known Bug

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages