Third ESCA/COCOSDA Workshop on Speech Synthesis
November 26-29, 1998
Danish has a distinctive vowel length opposition which is
realized with little differences in vowel qualities. This paper
investigates the possibilities of using this fact in reducing the
size of the speech unit database in a high quality concatenative
based text-to-speech system for Danish. The purpose is to
evaluate the concept of using long vowels for synthesizing the
corresponding short vowels. If this proves successful the size
of the speech unit database may be reduced by approximately
An acoustic analysis of the long and short vowels in the present speech unit database was performed. The results are presented in a F1-F2 plot, and demonstrate a significant overlap between long and short vowels. Consequently, two different strategies for synthesizing the short vowels from their long counterpart were tested. The first strategy used resegmented long vowel and the second relied entirely on the time-scaling technique built into the signal generation module.
The two strategies for synthesizing the short vowels were compared to using pre-recorded short vowels in a comprehensive listening test. The results of the listening test were based on 32 subjects judging intelligibility and naturalness.
The results show no significant differences between the prerecorded short vowels and the resegmented long vowels synthesized as short vowels. The resegmented long vowels will be implemented in the present text-to-speech system for further testing.
Full Paper (with 2 sound examples linked from within the paper)
Bibliographic reference. Andersen, Ove / Dyhr, N.-J. / Engberg, I. S. / Nielsen, C. (1998): "Synthesising Short Vowels from their Long Counterparts in a Concatenative Based Text-to-Speech System", In SSW3-1998, 165-170.