Bombsquig

Bombsquig, is an LLM inference engine built in Rust, designed for Apple Silicon Macs.

💣 Bombsquig is short for Because I Only have a MacBook, Super Qrabby Uber-cool Inference enGine!

Bombsquig is not intended to be a production-grade LLM inference engine. The project is my personal deep dive into the LLM inference stack, by building each layer from scratch.

From handwritten kernels, to tensor operations, to transformer implementation, each level of abstraction is deliberately in-house to explicitly understand the different components that make up LLM inference, alongside their performance characteristics and runtime challenges.

Existing runtimes solve inference at scale. Bombsquig exists (for me) to understand how they work.

Target Platform

Apple Silicon Macs (M1, M2, etc.), or any Aarch64 machine that supports Apple Metal.

Features

Bombsquig is actively being developed, with the following features:

Built in Rust for safety and performance
Uses NEON vectorization for tensor operations on CPU
Uses Apple Metal for tensor operations on GPU
Naive KV Cache for autoregressive decoding

As of now, Bombsquig only supports the Phi-3 mini model (3.8B parameters), but aims to be extensible to other models in the future.

Future features may include:

Support for more models (LLaMA-style, DeepSeek, etc.)
Quantization support (8-bit, 4-bit, etc.)
Flash Attention
Advanced KV Cache management (e.g., paged attention)
In-house tokenizer implementation
Batching optimization

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
benches		benches
src		src
tests		tests
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bombsquig

Target Platform

Features

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Bombsquig

Target Platform

Features

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages