pyannote.metrics: A Toolkit for Reproducible Evaluation, Diagnostic, and Error Analysis of Speaker Diarization Systems

Hervé Bredin


pyannote.metrics is an open-source Python library aimed at researchers working in the wide area of speaker diarization. It provides a command line interface (CLI) to improve reproducibility and comparison of speaker diarization research results. Through its application programming interface (API), a large set of evaluation metrics is available for diagnostic purposes of all modules of typical speaker diarization pipelines (speech activity detection, speaker change detection, clustering, and identification). Finally, thanks to visualization capabilities, we show that it can also be used for detailed error analysis purposes. pyannote.metrics can be downloaded from http://pyannote.github.io.


 DOI: 10.21437/Interspeech.2017-411

Cite as: Bredin, H. (2017) pyannote.metrics: A Toolkit for Reproducible Evaluation, Diagnostic, and Error Analysis of Speaker Diarization Systems. Proc. Interspeech 2017, 3587-3591, DOI: 10.21437/Interspeech.2017-411.


@inproceedings{Bredin2017,
  author={Hervé Bredin},
  title={ pyannote.metrics: A Toolkit for Reproducible Evaluation, Diagnostic, and Error Analysis of Speaker Diarization Systems},
  year=2017,
  booktitle={Proc. Interspeech 2017},
  pages={3587--3591},
  doi={10.21437/Interspeech.2017-411},
  url={http://dx.doi.org/10.21437/Interspeech.2017-411}
}