4th International Conference on Spoken Language Processing
Philadelphia, PA, USA
We describe a system designed to recognize the language of an utterance spoken by any native speaker over the telephone. The current approach extends our previous work on language-identification based on sequences of speech units . To improve performance we extend this work to allow for inaccurate matches of such sequences. Results are reported for distinguishing between English and German. The strength of this algorithm lies in the generalizability from training to test set. We have obtained a means of discriminating between languages based on statistical derivations. Matching sequences inaccurately in a controlled manner allows us to account for variabilities within languages without sacrificing cross language discrimination.
Bibliographic reference. Berkling, Kay M. / Barnard, Etienne (1996): "Language identification with inaccurate string matching", In ICSLP-1996, 1796-1799.