ai-llm-service

A lighweight service to serve ai/llm needs. All requests are queued as tasks and executed with some retry strategy by celery worker(s)

UV package manager

The project uses uv as its package manager. You will need to install it on your machine

UV can be installed system-wide using cURL on macOS and Linux:

curl -LsSf https://astral.sh/uv/install.sh | sudo sh

And with Powershell on Windows (make sure you run Powershell with administrator privileges):

powershell -ExecutionPolicy ByPass -c "irm https://astral.sh/uv/install.ps1 | iex"

UV is available via Homebrew as well:

brew install uv

Docker setup

Clone the repository:

git clone https://github.com/DalgoT4D/ai-llm-service.git

Navigate to the project directory:

cd ai-llm-service

Start the services

docker compose up --build

Bare setup

To run the ai-llm-service project, follow these steps:

Clone the repository:

git clone https://github.com/DalgoT4D/ai-llm-service.git

Navigate to the project directory:

cd ai-llm-service

Install the required dependencies:

uv sync

Setup your .env file, Make sure you have a redis server running

cp .env.example .env

Update the relevant fields in .env

Start the Celery worker(s):

uv run celery -A main.celery worker -n llm -Q llm --loglevel=INFO

Monitor your celery tasks and queues using flower:

uv run celery -A main.celery flower --port=5555

Dashboard will be available at http://localhost:5555

Start the FastAPI server:

uv run main.py

UV package management

To add new package using uv

uv add <package_name>

To remove a package using uv

uv remove <package_name>

Features supported

File search

Currently the service supports the openai's file search but can be easily extended to other services. The request response flow here is as follows

Client uploads a file (to query on) to the service.
Client uses the file_path from 1. to query. Note the client needs to provided with a system_prompt or an assistant_prompt. Client can do multiple queries here
Client polls for the response until the job/task reaches a terminal state.
Client gets the result with a session_id. Client can either continue querying the same file or close the session

API

API documentation can be found at https://llm.projecttech4dev.org/docs

Local docs can be found at http://127.0.0.1:7001/docs

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
config		config
src		src
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ai-llm-service

UV package manager

Docker setup

Bare setup

UV package management

Features supported

File search

API

About

Uh oh!

Releases 3

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

DalgoT4D/ai-llm-service

Folders and files

Latest commit

History

Repository files navigation

ai-llm-service

UV package manager

Docker setup

Bare setup

UV package management

Features supported

File search

API

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages