13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Wideband Parametric Speech Synthesis Using Warped Linear Prediction

Tuomo Raitio (1), Antti Suni (2), Martti Vainio (2), Paavo Alku (1)

(1) Department of Signal Processing and Acoustics, Aalto University, Espoo, Finland
(2) Department of Behavioural Sciences, University of Helsinki, Helsinki, Finland

This paper studies the use of warped linear prediction (WLP) for wideband parametric speech synthesis. As the sampling frequency is increased from the usual 16 kHz, linear frequency resolution of conventional linear prediction (LP) cannot efficiently model the speech spectrum. By using frequency warping that weights perceptually the most important formant information, spectral models with better accuracy and lower model orders can be utilized. In this work, WLP is embedded in a parametric speech synthesizer to efficiently create wideband synthetic speech. Experiments show that WLP-based wideband synthetic speech is rated better compared to narrowband speech and wideband LP-based speech.

Index Terms: statistical parametric speech synthesis, wideband, warped linear prediction, WLP

Full Paper

Bibliographic reference.  Raitio, Tuomo / Suni, Antti / Vainio, Martti / Alku, Paavo (2012): "Wideband parametric speech synthesis using warped linear prediction", In INTERSPEECH-2012, 1420-1423.