4th International Conference on Spoken Language Processing
Philadelphia, PA, USA
The number of languages in the world is much larger than the number of target languages that current language identification systems can handle. Therefore, we propose here the use of a multilayer perceptron neural network as a means to prevent those unknown language inputs from being misidentified as one of the target languages. We consider not only the target language identification rate but also the unknown language rejection rate. Results reveal that with the use of phonemic unigram as the input features to the neural network, a target language identification rate of 93.5% can be achieved for 3 languages. By varying the thresholds of the outputs, good unknown language rejection rate can also be obtained at the expense of lower identification rate.
Bibliographic reference. Kwan, Hingkeung / Hirose, Keikichi (1996): "Unknown language rejection in language identification system", In ICSLP-1996, 1776-1779.