13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

The 2011 NIST Language Recognition Evaluation

Craig S. Greenberg, Alvin F. Martin, Mark A. Przybocki

National Institute of Standards and Technology, Gaithersburg, MD, USA

In 2011, NIST held the most recent in an ongoing series of Language Recognition Evaluations originating in 1996. The 2011 NIST Language Recognition Evaluation (LRE11) featured 24 languages, including nine languages new to the LRE series, from two different source types, and had participation from 23 research organizations.
   LRE11 utilized a new evaluation metric, which focused on difficult to distinguish language pairs. The most difficult pairs were generally contained within clusters of linguistically similar languages. For example, the Hindi/Urdu pair and the Lao/Thai pair both proved to be very challenging to distinguish. Pashto and Bengali were found to be confusable with a wide range of languages, and some progress was observed in distinguishing American English from Indian English.

Index Terms: language recognition, language detection, NIST LRE, NIST evaluation

Full Paper

Bibliographic reference.  Greenberg, Craig S. / Martin, Alvin F. / Przybocki, Mark A. (2012): "The 2011 NIST language recognition evaluation", In INTERSPEECH-2012, 34-37.