GitHub - shivammittal274/LLM-CA

LLM For Finance (CA) POC

There are multiple steps involved running LLM on our data having GPTCache involved.

Step-1: Installation:

First install all set of required libraries.

    pip install -r requirements.txt

Step-2: Setting up:

Add your all files (currently supported files are: pdfs/ppts/mp4) under the folder name: Files, add all your files there on which you want to have your LLM based chatbot.

Step3: Ingestion pipeline:

This would scan all your files present in "Files" directory, extract our content from pdfs/ppts/mp4 files, and save the content in documents.npy files and create vectorstore.pkl file (which is actually used by LLM for getting similar embedding and returning answer on top of that.)

    python3 ingest.py

Running the demp:

Once your documents.npy and vectorstore.pkl file is saved, you can run app.py file and have a gradio app launched where you can ask your own custom LLM chatbot trained specifically on your data. You would be able to see answer specific to your question along with metric (whether cache was hit or not while answering the question.)

    python3 app.py

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
Readme.md		Readme.md
app.py		app.py
chatWithCache.py		chatWithCache.py
cosine_sim_eval.py		cosine_sim_eval.py
documents.npy		documents.npy
faiss.index		faiss.index
ingest.py		ingest.py
query_data.py		query_data.py
settings.py		settings.py
vectorstore.pkl		vectorstore.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

shivammittal274/LLM-CA

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages