NBA Match Predictor V5

A production-ready machine learning system that predicts NBA game outcomes. It features a full pipeline: Data Collection -> Training (HistGradientBoosting) -> Live Inference -> React Frontend -> Daily Automation.

Features

Advanced Modeling: Uses HistGradientBoostingClassifier with "Matchup Merge" architecture (comparing Team Form vs Opponent Form).
Live Predictions: Fetches today's games via nba_api, processes stats in real-time, and predicts winners with confidence scores.
Automated Pipeline: A GitHub Action (.github/workflows/daily_prediction.yml) runs every morning at 6 AM ET to generate new predictions.
React Frontend: A clean UI to view today's games and the model's picks.

Performance

Model: HistGradientBoostingClassifier
Key Features: 50 selected predictors including Rolling Advanced Stats (orb%, drtg, etc.) and "Matchup" differentials.

Tech Stack

ML/Backend: Python, scikit-learn, pandas, numpy, nba_api
Frontend: React.js, CSS Modules
CI/CD: GitHub Actions (Daily Cron Job)

Project Structure

nba-match-predictor/
├── predictors/
│   └── predictor_v5.ipynb      # Main training notebook (Analysis & Retraining)
├── models/
│   └── hist_gbm_v5/            # Serialized Model, Scaler, and Predictor list
├── scripts/
│   └── predict_v5.py           # PRODUCTION SCRIPT: Generates today's predictions
├── frontend/                   # React Application
│   ├── public/data/            # Contains schedule and generated predictions.json
│   └── src/                    # Frontend source
├── data/                       # Raw training data (gitignored)
└── .github/workflows/          # Automation configuration

How to Run

1. Generate Live Predictions

To run the prediction system locally:

pip install -r requirements.txt
python scripts/predict_v5.py

This will fetch today's games and save the results to frontend/public/data/predictions.json.

2. Run the Frontend

cd frontend
npm install
npm start

Open http://localhost:3000 to see the dashboard.

3. Retrain the Model

Open predictors/predictor_v5.ipynb in Jupyter. This notebook contains the full pipeline to:

Load data/nba_games_raw.csv
Clean and Compute Rolling Averages
Train the HistGradientBoosting model
Save artifacts to models/

Automation

The project is configured to run automatically via GitHub Actions.

Schedule: Every day at 11:00 UTC (6:00 AM ET).
Action: Runs predict_v5.py, commits the new predictions.json, and pushes to the repo.
Deploy: Vercel (linked to the repo) automatically deploys the updated frontend.

License

MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
.github/workflows		.github/workflows
data		data
frontend		frontend
models		models
predictors		predictors
scripts		scripts
training		training
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
automation_log.txt		automation_log.txt
convert_data.py		convert_data.py
requirements.txt		requirements.txt
run_daily_predictions.bat		run_daily_predictions.bat
task_scheduler_config.xml		task_scheduler_config.xml
test_prediction.py		test_prediction.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NBA Match Predictor V5

Features

Performance

Tech Stack

Project Structure

How to Run

1. Generate Live Predictions

2. Run the Frontend

3. Retrain the Model

Automation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NBA Match Predictor V5

Features

Performance

Tech Stack

Project Structure

How to Run

1. Generate Live Predictions

2. Run the Frontend

3. Retrain the Model

Automation

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages