4th International Conference on Spoken Language Processing

Philadelphia, PA, USA
October 3-6, 1996

Improving Text-to-Speech Synthesis

Mark Tatham (1), Eric Lewis (2)

(1) Department of Language and Linguistics, Essex University, Colchester, UK
(2) Department of Computer Science, University of Bristol, Bristol, UK

Naturalness in human speech is dependent on a number of factors and the extent to which a text-to-speech synthesis system can account for these factors in its model will be a measure of its success in the marketplace. As well as the obvious factors of rhythm and intonation there is the more difficult question of modelling the variability in human speech. This paper discusses how SPRUCE [1], a high-level text-to-speech synthesis system, incorporates several different types of variability.

Full Paper

Bibliographic reference.  Tatham, Mark / Lewis, Eric (1996): "Improving text-to-speech synthesis", In ICSLP-1996, 1856-1859.