13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Patrol Team Language Identification System for DARPA RATS P1 Evaluation

Pavel Matějka (1), Oldřich Plchot (1), Mehdi Soufifar (1), Ondřej Glembek (1), Luis Fernando D'Haro (1), Karel Veselý (1), František Grézl (1), Jeff Ma (2), Spyros Matsoukas (2), Najim Dehak (3)

(1) Brno University of Technology, Speech@FIT and IT4I Center of Excellence, Brno, Czech Republic
(2) Raytheon BBN Technologies, Cambridge, MA, USA
(3) MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA, USA

This paper describes the language identification (LID) system developed by the Patrol team for the first phase of the DARPA RATS (Robust Automatic Transcription of Speech) program, which seeks to advance state of the art detection capabilities on audio from highly degraded communication channels. We show that techniques originally developed for LID on telephone speech (e.g., for the NIST language recognition evaluations) remain effective on the noisy RATS data,provided that careful consideration is applied when designing the training and development sets. In addition, we show significant improvements from the use of Wiener filtering, neural network based i-vector, language dependent i-vector modeling, and fusion.

Index Terms: language identification, noisy speech

Full Paper

Bibliographic reference.  Matějka, Pavel / Plchot, Oldřich / Soufifar, Mehdi / Glembek, Ondřej / D'Haro, Luis Fernando / Veselý, Karel / Grézl, František / Ma, Jeff / Matsoukas, Spyros / Dehak, Najim (2012): "Patrol team language identification system for DARPA RATS P1 evaluation", In INTERSPEECH-2012, 50-53.