Fourth ISCA ITRW on Speech Synthesis

August 29 - September 1, 2001
Perthshire, Scotland

A new distance measure for costing spectral discontinuities in concatenative speech synthesizers

Robert E. Donovan

IBM T. J. Watson Research Center, Yorktown Heights, NY, USA

In many modern concatenative speech synthesisers the unit sequence used to synthesise each sentence is determined at runtime by a search algorithm seeking to optimise a multidimensional cost function. One of these costs is usually some form of spectral continuity cost, computed between the end of one segment and the start of the following segment, intended to ensure that the synthetic speech does not contain any unpleasant spectral discontinuities. This paper presents the results of listening tests conducted to evaluate the performance of several possible continuity measures. It also describes a new continuity measure developed at IBM which substantially out-performs all other measures tested.

Full Paper

Bibliographic reference.  Donovan, Robert E. (2001): "A new distance measure for costing spectral discontinuities in concatenative speech synthesizers", In SSW4-2001, paper 123.