First International Conference on Spoken Language Processing (ICSLP 90)

Kobe, Japan
November 18-22, 1990

The Improved Source Model for High-Quality Synthetic Speech Sound

Tomoki Hamagami, Shinichiro Hashimoto

SECOM Intelligent Systems Laboratory, SECOM Co.,Ltd., Tokyo, Japan

We describe a new speech production model for improving the quality of synthetic vowel speech. The strength of this model is that the source model has a continuous harmonic structure in the time domain and the frequency domain. This report provides a comparative study of the ordinary source model, such as impulse and Rosenberg one, and the new model. The listening experiments have confirmed that our model produces high quality synthetic speech which is better as compared with synthetic ones using ordinary models. Thus, we understand that the really continuous harmonic structure in speech spectrum significantly contributes to synthetic speech quality.

Full Paper

Bibliographic reference.  Hamagami, Tomoki / Hashimoto, Shinichiro (1990): "The improved source model for high-quality synthetic speech sound", In ICSLP-1990, 805-808.