In 2011, NIST held the most recent in an ongoing series of Language Recognition
Evaluations originating in 1996. The 2011 NIST Language Recognition Evaluation
(LRE11) featured 24 languages, including nine languages new to the LRE series,
from two different source types, and had participation from 23 research organizations.
LRE11 utilized a new evaluation metric, which focused on difficult to distinguish language pairs. The most difficult pairs were generally contained within clusters of linguistically similar languages. For example, the Hindi/Urdu pair and the Lao/Thai pair both proved to be very challenging to distinguish. Pashto and Bengali were found to be confusable with a wide range of languages, and some progress was observed in distinguishing American English from Indian English.
Index Terms: language recognition, language detection, NIST LRE, NIST evaluation
Bibliographic reference. Greenberg, Craig S. / Martin, Alvin F. / Przybocki, Mark A. (2012): "The 2011 NIST language recognition evaluation", In INTERSPEECH-2012, 34-37.