MNIST End to End ML model deployment

Efficiently deploy a machine learning model using containerized environments with data versioning, training, and live inference capabilities. Includes reproducible pipelines, monitoring, logging, and infrastructure automation to ensure scalable and robust performance for ML applications.

This repository contains the code and scripts to train, deploy, and serve a simple neural network model for the MNIST dataset. The solution is designed to be reproducible, scalable, and efficient, with infrastructure automation, monitoring, and logging.

Environment Setup

The environment is set up using Docker to ensure reproducibility across different platforms.

Steps:

Clone this repository:

git clone git@github.com:Cpicon/e2e_ml_application.git
cd e2e_ml_application

Prerequisites

Python 3.10
Docker
Poetry (Python dependency management tool)

Setting Up the Environment

Create a Virtual Environment:
- First, create a virtual environment using Python 3.10:
```
python3.10 -m venv venv
source venv/bin/activate
```
Install Poetry:
- Install Poetry within the virtual environment:
```
pip install poetry
```
Install Project Dependencies:
- Use Poetry to install all the required packages:
```
poetry install
```

Set Environment Variables:

Export the necessary environment variables for AWS and MLflow:

export AWS_ACCESS_KEY_ID="awsaccesskey"
export AWS_SECRET_ACCESS_KEY="awssecretkey"
export LOCAL_MLFLOW_S3_ENDPOINT_URL="http://localhost:9000"

Building and Running Docker Containers

Build Docker Images:
- Use make to build the Docker images:
```
make build
```
Run Docker Containers:
- After building the images, start the services using:
```
cp deployment/dev/local.env deployment/dev/.env
make run
```
Stop Docker Containers:
- To stop the services, use:
```
make stop
```
Remove Docker Containers:
- To remove the containers, use:
```
make clean
```

Using the Services

Dagster (Pipeline Orchestrator):
- Runs on http://localhost:3000/.
- Navigate to the "Overview/Jobs" section to view and manage your pipelines.
MLflow (Experiment Tracking and Model Registry):
- Available on http://localhost:5005.
- Use MLflow to track your experiments, register models, and manage model versions.
Minio (Object and Model Storage):
- Runs on http://localhost:9001.
- Minio serves as the object storage solution for the models and data.
MLServer (Model Deployment Service, HTTP Backend):
- Accessible at http://localhost:9595.
- Use MLServer for deploying machine learning models with a RESTful API backend.

Running and Training a Model

To run and train a model, watch the following video tutorial:

Deploying a Model

To deploy a model, watch the following video tutorial:

Testing the Deployed Model

To test the deployed model, navigate to the root project folder and run:
```
python model_query_example.py
```

Architecture Overview

The architecture of this project is centered around several key components orchestrated using Docker containers. Below is a high-level overview of the services involved:

1. Dagster

Purpose: Dagster is used as the pipeline orchestrator for managing and executing data pipelines.
Components:
- Dagster PostgreSQL: This service runs a PostgreSQL database for storing Dagster's run storage, schedule storage, and event logs.
- Dagster User Code: This container runs the gRPC server that loads your user code, enabling Dagster to execute pipelines. It is configured to use the same image when launching runs in new containers.
- Dagster Webserver: This service provides a web interface for interacting with Dagster, where you can view and manage pipelines.
- Dagster Daemon: The daemon process is responsible for taking runs off of the queue and launching them, as well as handling schedules and sensors.
References:

2. MLflow

Purpose: MLflow is used for tracking experiments, registering models, and managing model versions.
Components:
- MLflow PostgreSQL: A PostgreSQL database for storing MLflow tracking metadata.
- MLflow Tracking Server: This service hosts the MLflow server for tracking experiments and managing models.

3. Minio

Purpose: Minio is used as an object storage solution for storing datasets, models, and other artifacts.
Components:
- Minio Server: A high-performance object storage server.
- Minio Client (mc): A command-line tool for interacting with Minio.

4. MLServer

Purpose: MLServer is used for deploying machine learning models via a RESTful API.
Components:
- MLServer: The main service responsible for serving the machine learning models.
Reference: MLServer Documentation

Network and Volumes

Networks: All services are connected through a project_network to facilitate communication between containers.
Volumes: Persistent storage is managed using Docker volumes, ensuring that data persists across container restarts.

Project Structure

deployment/
- Contains Docker-related configurations and deployment scripts.
- dev/: Contains development-specific Dockerfiles, Docker Compose files, and configuration files for setting up the environment.
- prod/: Contains production-specific Docker Compose file.
- stage/: Contains staging-specific Docker Compose file.
e2eML/
- The main package directory for the project. It contains various submodules responsible for different stages of the machine learning lifecycle.
- clients/: Contains code to interact with external services such as MLServer.
- evaluate/: Handles the evaluation of machine learning models.
- inference/: Contains scripts for making predictions using trained models.
- ingest/: Responsible for data ingestion processes, such as loading datasets.
- models/: Contains definitions for machine learning models.
- orchestrator/: Contains code related to pipeline orchestration, likely tied to Dagster.
- pipeline_configs/: Holds configuration files for various pipelines.
- train/: Contains scripts and modules for training machine learning models.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
deployment		deployment
e2eML		e2eML
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
model_query_example.py		model_query_example.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MNIST End to End ML model deployment

Environment Setup

Steps:

Prerequisites

Setting Up the Environment

Building and Running Docker Containers

Using the Services

Running and Training a Model

Deploying a Model

Testing the Deployed Model

Architecture Overview

1. Dagster

2. MLflow

3. Minio

4. MLServer

Network and Volumes

Project Structure

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

MNIST End to End ML model deployment

Environment Setup

Steps:

Prerequisites

Setting Up the Environment

Building and Running Docker Containers

Using the Services

Running and Training a Model

Deploying a Model

Testing the Deployed Model

Architecture Overview

1. Dagster

2. MLflow

3. Minio

4. MLServer

Network and Volumes

Project Structure

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages