Whisper Equinox

Porting 🤗's Whisper implementation for Equinox.

Outline

For this project, I wanted to take up uv so its built around it. Right now, all this has just been tested on CPU - so for TPU one may need to uv add the TPU versions of JAX.

As for the project structure (this could do with some cleanup):

run_e2e.py actually runs the equinox and HF model end-to-end (i.e consumes audio and produces text) and verifies the outputs match up.
verify.py is primilary to test whether the equinox port is correct vs. the HF model. It compares the last_hidden_state as that's more convenient.

This should be your first port-of-call whenever debugging any differences in the implementation. As a bonus, you also a get some statistics + a histogram of the deviations.

modelling.py is the actual equinox port, the analogue of modelling_whisper.py used in the HF implementation and WhisperJAX.

Tests are pretty barebones, and exist for the encoder and decoder seperately.

A lot of optimizations (like KV caching) haven't been written yet. So this is more of a first-past port which adds JAX-friendly static-ness and relies on XLA for performance.

There's a lot of room for further speedups IMO.

Useful commands

Prefix running any (python) command with uv via:

uv run --env-file=.env <command>

Run tests: Ensure you're at the project's root and run either:

uv run --env-file=.env pytest -s ./tests/test_encoder.py
uv run --env-file=.env pytest -s ./tests/test_decoder.py

For example, to verify the model outputs match up (e2e verification):

uv run --env-file=.env python3 ./src/verify.py

If --env-file feels cumbersome, you could just export its contents as well. I kept it incase we add some more envvars later (tokens and whatnot).

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
src		src
tests		tests
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
pyproject.toml		pyproject.toml
pyrightconfig.json		pyrightconfig.json
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Whisper Equinox

Outline

Useful commands

About

Uh oh!

Releases

Packages

Uh oh!

Languages

felafax/whisper_eqx

Folders and files

Latest commit

History

Repository files navigation

Whisper Equinox

Outline

Useful commands

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages