EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology
2nd INTERSPEECH Event

Aalborg, Denmark
September 3-7, 2001

                 

Application of the Trended Hidden Markov Model to Speech Synthesis

John Dines, Sridha Sridharan, Miles Moody

RCSAVT, QUT, Australia

This paper presents our work on a speech synthesis system that utilises the trended Hidden Markov Model to represent the basic synthesis unit. We draw upon both speech recognition and speech synthesis research to develop a system that is able to synthesise intelligible and natural sounding speech. Acoustic units are clustered using the decision tree technique and speech data corresponding to these clusters is used for the training of trended Hidden Markov Model synthesis units. The overall system has been implemented in a PSOLA synthesiser and the resultant speech has been compared with that produced by a conventional diphone synthesiser to yield very encouraging results.

Full Paper

Bibliographic reference.  Dines, John / Sridharan, Sridha / Moody, Miles (2001): "Application of the trended hidden Markov model to speech synthesis", In EUROSPEECH-2001, 2239-2242.