Skip to content

v0.1.2

Latest

Choose a tag to compare

@loookashow loookashow released this 06 May 10:03
4f25d27

What's Changed

diarize 0.1.2 focuses on diarization quality, reproducible benchmarks, and clearer accuracy documentation.

Improvements

  • Reduced short speaker label switching with temporal smoothing during diarization assembly.
  • Improved automatic speaker-count selection with silhouette refinement plus a small larger-k prior.
  • Added scripts/benchmark_rttm.py for reproducible audio+RTTM benchmark runs across VoxConverse, AMI, and similar datasets.

Benchmarks and Docs

  • Updated VoxConverse dev benchmark numbers:
    • Weighted DER: ~4.8%
    • Speaker count: 125/216 exact, 178/216 within ±1
  • Added preliminary AMI Mix-Headset test validation:
    • Weighted DER: 14.96%
    • Speaker count: 4/16 exact, 8/16 within ±1
  • Documented known limitations around speaker-count errors and speaker label fragmentation.
  • Added a Changelog page to the documentation.

Package

  • Synced package metadata and runtime diarize.__version__ to 0.1.2.