diff --git a/CHANGELOG.md b/CHANGELOG.md index 475f72f..92ecb6e 100644 --- a/CHANGELOG.md +++ b/CHANGELOG.md @@ -7,7 +7,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0 ## [Unreleased] -## [0.1.0] - 2026-02-27 +## [0.1.0] - 2026-03-01 ### Added @@ -28,5 +28,12 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0 - Scratch space management with automatic cleanup of intermediate files - GPU performance-based ordering via `gpu_tflops.toml` lookup table +### Fixed + +- WhisperX model loading: pinned install to commit `741ab9a` (v3.7.6) — the `v3.1.1` tag uses an older API (`transcribe.py`) that lacks the `device_index` parameter required by the ctranslate2 backend +- `make install-whisperx` now also installs WhisperX runtime deps (`av`, `ctranslate2`, `faster-whisper`, `flatbuffers`, `nltk`, `onnxruntime`) which were previously missing after a `--no-deps` install +- `transformers` capped at `<4.40.0` — versions 4.40+ use `torch.utils._pytree.register_pytree_node`, an API introduced in PyTorch 2.2, which breaks with the pinned PyTorch 2.1.2 +- `make dev-setup` now reinstalls CUDA torch wheels (`torch==2.1.2+cu121`, `torchaudio==2.1.2+cu121`) as its final step — `uv sync` resolves torch from PyPI and installs the CPU-only build, silently breaking GPU inference + [Unreleased]: https://github.com/LunarCommand/audio-refinery/compare/v0.1.0...HEAD [0.1.0]: https://github.com/LunarCommand/audio-refinery/releases/tag/v0.1.0