Speech Prosody 2010

Chicago, IL, USA
May 10-14, 2010

Modelling Filled Pauses Prosody to Synthesise Disfluent Speech

Jordi Adell (1), Antonio Bonafonte (1), David Escudero-Mancebo (2)

(1) TALP Research Center, Universitat Politècnica de Catalunya, Spain
(2) ECA-SIMM Laboratory, Universidad de Valladolid, Spain

In the present paper we present a new approach to the synthesis of filled pauses since they are as frequent as most frequent words in conversational speech. The problem is tackled from the point of view of disfluent speech synthesis. Based on the synthetic disfluent speech model, we analyse the features that describe filled pauses and propose a model to predict them. The model was implemented and perceptually evaluated with successful results.

Index Terms: speech synthesis, disfluent speech, filled pauses

