Speech Prosody 2010

Chicago, IL, USA
May 10-14, 2010

Prosody Labeling and Modeling for Mandarin Spontaneous Speech

Yu-Lun Chou (1), Chen-Yu Chiang (1), Yih-Ru Wang (1), Hsiu-Min Yu (2), Sin-Horng Chen (1)

(1) Dept. of Communication Engineering, NCTU, Taiwan; (2) Language Center, Chung Hua University, Taiwan

An unsupervised joint prosody labeling and modeling (PLM) method for exploring the prosody of spontaneous Mandarin speech is proposed. It is designed to automatically label a speech corpus and construct prosodic models simultaneously. Experimental results on a large dialog corpus confirmed its effectiveness. Many meaningful characteristics of spontaneous-speech prosody were investigated from the parameters of the well-trained prosodic models. The prosodic feature patterns of high-level constituents of the postulated prosody hierarchy were derived. An analysis of disfluencies related to the labeling results was also discussed. Those findings would provide rich prosodic information for various speech processing applications.

Full Paper

Bibliographic reference.  Chou, Yu-Lun / Chiang, Chen-Yu / Wang, Yih-Ru / Yu, Hsiu-Min / Chen, Sin-Horng (2010): "Prosody labeling and modeling for Mandarin spontaneous speech", In SP-2010, paper 087.