EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology
2nd INTERSPEECH Event

Aalborg, Denmark
September 3-7, 2001

                 

Automatic Rhythm Modeling for Language Identification

Jérôme Farinas (1), François Pellegrino (2)

(1) IRIT, France; (2) DDL, France

This paper deals with an approach to Automatic Language Identification based on rhythmic modeling. Beside phonetics and phonotactics, rhythm is actually one of the most promising features to be considered for language identification, but significant problems are unresolved for its modeling. In this paper, an algorithm of rhythm extraction is described. Experiments are performed on read speech for 5 European languages. They show that salient features may be automatically extracted and efficiently modeled from the raw signal: a Gaussian mixture modeling of the extracted features results in a 81% percent of correct language identification for the 5 languages, using 20 s duration utterances.

Full Paper

Bibliographic reference.  Farinas, Jérôme / Pellegrino, François (2001): "Automatic rhythm modeling for language identification", In EUROSPEECH-2001, 2539-2542.