Speech Prosody 2002

Aix-en-Provence, France
April 11-13, 2002

Can Automatically Extracted Rhythmic Units Discriminate among Languages?

François Pellegrino (1), Jean-Hugues Chauchat (2), Ricco Rakotomalala (2), Jérôme Farinas (3)

(1) DDL UMR 5596 CNRS - Univ. Lumière Lyon 2, Lyon, France
(2) ERIC Univ. Lumière Lyon 2, Lyon, France
(3) IRIT UMR 5505 CNRS - Univ. Toulouse 3, Toulouse, France

This paper deals with rhythmic modeling and its application to language identification. Beside phonetics and phonotactics, rhythm is actually one of the most promising features to be considered for language identification, but significant problems are unresolved for its modeling. In this paper, an algorithm dedicated to rhythmic segmentation is described. Experiments are performed on read speech for 5 European languages. Several algorithms are compared. They show that salient features may be automatically extracted and efficiently modeled from the raw signal: a linear discriminant analysis of the extracted features results in a 80 % percent of correct language identification for the 5 languages, using 20 s duration utterances. Additional experiments reveal that the automatic rhythmic units convey also speaker specific features.

Full Paper

Bibliographic reference.  Pellegrino, François / Chauchat, Jean-Hugues / Rakotomalala, Ricco / Farinas, Jérôme (2002): "Can automatically extracted rhythmic units discriminate among languages?", In SP-2002, 563-566.