Fourth ISCA ITRW on Speech Synthesis

August 29 - September 1, 2001
Perthshire, Scotland

Optimal data selection for unit selection synthesis

Alan W. Black and Kevin A. Lenzo

Carnegie-Mellon University and Cepstral, LLC, Pittsburgh, PA, USA

In this work, we address the issue of creating a set of utterances with optimal coverage for reliable, high quality concatenative synthesis, whether for general synthesis or domain synthesis. We present an automatic method that takes into account the acoustic distinctions made by a particular speaker and selects prompts from large databases of typical utterances. A general unit selection text-to-speech system created by this process can synthesize any input text, but the output is best for content intended to be similar to that in the database in terms of style, delivery, and coverage.

Full Paper

Bibliographic reference.  Black, Alan W. / Lenzo, Kevin A. (2001): "Optimal data selection for unit selection synthesis", In SSW4-2001, paper 129.