The Daily 🕸 Bugle

I'm building a comic nerd app that answers my nerdy questions about superheroes. This keeps track of all the superhero stuff going on in the world. You can ask trivia things including coolest easter eggs. I am also reading about LLMOps and AI Engineering.

My project map:

~~Setup and installations~~
Collect data
Fine-tune an open source llm
Vector db
Query the system with prompts
Create a simple ui

Index

Data Collection
Feature Engineering
Training/Finetuning a LLM
Inference Service
Monitoring
UI/UX

Setup

Environment

I am using conda for creating environments.

conda create -n comic python=3.11

conda activate comic

I am using poetry for package management. I will use uv in production because its very fast.

cd comic

poetry init

poetry add numpy pandas

Dependencies

Instead of using Makefile for simple automation, i'm using poe the poet plugin. I can just add the scripts in the pyproject.toml file and execute them. It works well with poetry.

I'm using ZenML as an orchestrator to manage my pipelines. It glues multiple @steps with a @pipeline. There are other orchestrators like Airflow, Prefect, Argo, Kubeflow that are popular.

Track experiments using CometML and prompts using Opik.

MongoDB for storing scraped data and Qdrant for storing vector representations.

Finally I'm using AWS Sagemaker for training as i'm more familiar with it. You can use AWS Bedrock as well and you don't have to manage infra.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Daily 🕸 Bugle

Index

Setup

Environment

Dependencies

Data Collection Pipeline

About

Releases

Packages

License

aniket-mish/the-daily-bugle

Folders and files

Latest commit

History

Repository files navigation

The Daily 🕸 Bugle

Index

Setup

Environment

Dependencies

Data Collection Pipeline

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages