13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Study on Integration of Speaker Diarization with Speaker Adaptive Speech Recognition for Broadcast Transcription

Jan Silovsky, Petr Cerva, Jindrich Zdansky, Jan Nouza

Institute of Information Technology and Electronics, Faculty of Mechatronics, Technical University of Liberec, Liberec, Czech Republic

In this paper we study a close incorporation of speaker diarization with speaker adaptive speech recognition in our broadcast transcription system. We provide our motivation for utilization of speech transcripts in the diarization process and analyze the effect it yields in terms of diarization performance or computational cost. Further, speaker adaptation performed according to various scenarios of speaker segmentation and diarization of an audio stream is evaluated. For better insight, the limit performance is evaluated substituting most of the components of the system by the oracle ones.

Index Terms: Speaker diarization, i-vectors, speaker adaptation, CMLLR, broadcast transcription

Full Paper

Bibliographic reference.  Silovsky, Jan / Cerva, Petr / Zdansky, Jindrich / Nouza, Jan (2012): "Study on integration of speaker diarization with speaker adaptive speech recognition for broadcast transcription", In INTERSPEECH-2012, 478-481.