EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology
2nd INTERSPEECH Event

Aalborg, Denmark
September 3-7, 2001

                 

Reducing Spectral Mismatches in Concatenative Speech Synthesis via Systematic Database Enrichment

Maria Founda, George Tambouratzis, Aimilios Chalamandaris, George Carayannis

Institute for Language and Speech Processing, Greece

This paper presents work performed for the Time-Domain TTS system, which is being developed at the ILSP for the Greek language. It focuses on the enhancement of the synthetic speech quality, by reducing the spectral mismatches between concatenated segments. To that end, a study has been performed to determine the distance that can best predict when a spectral mismatch is audible. Experimentation with different spectral distances has taken place and the distance with the best performance has been used in order to systematically enrich the segment database, which initially contained only one instance per segment. Results of this procedure indicate a substantial improvement in the synthetic speech quality.

Full Paper

Bibliographic reference.  Founda, Maria / Tambouratzis, George / Chalamandaris, Aimilios / Carayannis, George (2001): "Reducing spectral mismatches in concatenative speech synthesis via systematic database enrichment", In EUROSPEECH-2001, 837-840.