Ninth International Conference on Spoken Language Processing

Pittsburgh, PA, USA
September 17-21, 2006

Phrase Break Prediction Using Logistic Generalized Linear Model

Lifu Yi, Jian Li, Xiaoyan Lou, Jie Hao

Toshiba China R&D Center, China

In this paper we propose a novel phrase break prediction model for Mandarin speech synthesis. It is generalized linear models (GLM) with stepwise regression solution. We assume phrase break obeys Bernoulli distribution and then model phrase break probability by Logistic GLM. The attribute set is automatically selected by stepwise regression, which is a totally data-driven method. We also introduce speaking rate as a new attribute for prediction. The proposed method is applied to 2,150 utterances of the Mandarin speech corpus, and it achieves 5.4% higher performances than CART method in open test. The method can be extended to include more linguistic and prosodic attributes and it is very compact for application.

Full Paper

Bibliographic reference.  Yi, Lifu / Li, Jian / Lou, Xiaoyan / Hao, Jie (2006): "Phrase break prediction using logistic generalized linear model", In INTERSPEECH-2006, paper 1468-Tue3BuP.4.