AI Malware Detection

Real-time malware classification powered by machine learning — trained on 1M+ Windows PE files

Overview

AI Malware Detection is a full-stack application that scans Windows executables (.exe / .dll) and classifies them as benign or malicious using a model trained on the EMBER dataset (1 million+ real-world PE files). The system extracts 2,381 PE characteristics via the LIEF library, runs them through a LightGBM binary classifier, and returns a confidence-scored verdict in 2-3 seconds.

Benign Result	Malware Result
GREEN	RED
Confidence score + SHA-256 hash	Confidence score + SHA-256 hash

Features

ML-Powered Classification — EMBER-trained LightGBM model extracting 2,381 PE features via LIEF
Modern Glassmorphic UI — animated interface with color-coded results (green = safe, red = malware)
FastAPI Backend — lightweight Python API with /scan and /health endpoints
Confidence Scoring — 0-100 % confidence on every scan with SHA-256 hash and file size
CI/CD Pipeline — GitHub Actions testing across Python 3.9, 3.10, 3.11 with flake8 linting

Tech Stack

Layer	Technology
Backend	Python 3.9+, FastAPI, Uvicorn
ML Model	LightGBM, EMBER dataset, LIEF, scikit-learn
Frontend	HTML5, CSS3 (glassmorphism + animations), JavaScript ES6+
CI/CD	GitHub Actions
Deep Learning (backup)	TensorFlow, MalConv neural network

Project Structure

AI-Malware-Detection/
├── app.py                      # FastAPI backend — scan endpoint + EMBER model loading
├── ui/
│   ├── index.html              # Web interface
│   ├── app.js                  # Upload logic & result rendering
│   └── styles.css              # Glassmorphic animations & theming
├── ember/                      # EMBER feature extraction module
├── malconv/                    # MalConv neural network model (backup)
├── scripts/                    # Utility scripts
├── start.sh                    # Launch backend + frontend servers
├── stop.sh                     # Graceful shutdown
├── requirements.txt            # Python dependencies
├── .github/workflows/          # CI pipeline (lint + test)
└── licenses/                   # MIT + AGPL-3.0

Getting Started

Prerequisites

Python 3.9, 3.10, or 3.11
pip

Installation

git clone https://github.com/vuvic/AI-Malware-Detection.git
cd AI-Malware-Detection
pip install -r requirements.txt

Run

bash start.sh

Service	URL
Backend API	`http://localhost:8000`
Web UI	`http://localhost:8080`

# Stop everything
bash stop.sh

API Reference

`POST /scan`

Upload a PE file for classification.

// Response
{
  "filename": "test.exe",
  "verdict": "BENIGN",        // or "MALWARE"
  "confidence": 95.5,
  "sha256": "abc123...",
  "file_size_kb": 48.83,
  "message": "This file appears safe"
}

`GET /health`

{ "status": "healthy", "service": "AI Malware Scanner", "version": "1.0.0" }

Performance

Metric	Value
Scan Speed	2-3 s per file
Startup Memory	~200 MB
CPU During Scan	~5-10 %
Supported Formats	`.exe`, `.dll` (Windows PE)
Max Upload Size	100 MB

Future Enhancements

Scan history database
User authentication
Analytics dashboard
Docker containerization
Batch scanning API

References

Author

Aman Imran — GitHub · LinkedIn · Portfolio

B.S. Software Engineering — San Jose State University

_{Dual licensed under MIT and AGPL-3.0 — see licenses/}

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
.github/workflows		.github/workflows
ember		ember
licenses		licenses
malconv		malconv
resources		resources
scripts		scripts
ui		ui
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE.txt		LICENSE.txt
README.md		README.md
README_SUBMISSION.md		README_SUBMISSION.md
app.py		app.py
requirements.txt		requirements.txt
requirements_conda.txt		requirements_conda.txt
requirements_notebook.txt		requirements_notebook.txt
setup.py		setup.py
start.sh		start.sh
stop.sh		stop.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Malware Detection

Overview

Features

Tech Stack

Project Structure

Getting Started

Prerequisites

Installation

Run

API Reference

`POST /scan`

`GET /health`

Performance

Future Enhancements

References

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Malware Detection

Overview

Features

Tech Stack

Project Structure

Getting Started

Prerequisites

Installation

Run

API Reference

POST /scan

GET /health

Performance

Future Enhancements

References

Author

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`POST /scan`

`GET /health`

Packages