Fourth European Conference on Speech Communication and Technology

Madrid, Spain
September 18-21, 1995

Interpolating MBE V/UV Mixture Function for High Quality Synthesis of Speech

King-fai Lam, Cheung-fat Chan

Department of Electronic Engineering, City University of Hong Kong, Hong Kong

A high quality speech synthesis method based on interpolating the voiced/unvoiced (V/UV) mixture functions [1] of the multiband excitation model (MBE) [2,3] is proposed. In MBE model, each harmonic band of fundamental frequency in an excitation spectrum is rigidly declared as either voiced or unvoiced while it should be a mixture of two. In the proposed method, each harmonic band in a short time spectrum is synthesized by mixing both voiced and unvoiced energies. The ratio of the V/UV energy in a spectrum is determined by the V/UV mixture function which is subsequently parametized by an all-zero model. Smooth transition of excitation between phonetic units can be achieved by linearly interpolating the V/UV mixture functions of adjacent frames.

Full Paper

Bibliographic reference.  Lam, King-fai / Chan, Cheung-fat (1995): "Interpolating MBE v/UV mixture function for high quality synthesis of speech", In EUROSPEECH-1995, 443-447.