What's Changed
diarize 0.1.2 focuses on diarization quality, reproducible benchmarks, and clearer accuracy documentation.
Improvements
- Reduced short speaker label switching with temporal smoothing during diarization assembly.
- Improved automatic speaker-count selection with silhouette refinement plus a small larger-k prior.
- Added
scripts/benchmark_rttm.pyfor reproducible audio+RTTM benchmark runs across VoxConverse, AMI, and similar datasets.
Benchmarks and Docs
- Updated VoxConverse dev benchmark numbers:
- Weighted DER: ~4.8%
- Speaker count: 125/216 exact, 178/216 within ±1
- Added preliminary AMI Mix-Headset test validation:
- Weighted DER: 14.96%
- Speaker count: 4/16 exact, 8/16 within ±1
- Documented known limitations around speaker-count errors and speaker label fragmentation.
- Added a Changelog page to the documentation.
Package
- Synced package metadata and runtime
diarize.__version__to0.1.2.