13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Verifying Session Level Pronunciation Accuracy in a Speech Therapy Application

Shou-Chun Yin (1), Richard C. Rose (1), Yun Tang (2)

(1) Department of Electrical and Computer Engineering, McGill University, Montreal, Canada
(2) Nuance Communications Inc., Montreal, Canada

This paper investigates a new pronunciation verification (PV) approach obtained from the subspace based Gaussian mixture model (SGMM) based pronunciation model. A single SGMM model is trained from disabled speakers' utterances and reference speakers' utterances.The PV scores are computed directly from distances between disabled and reference speaker projection vectors. Both session level and utterance level PV scenarios are presented and evaluated. The PV performance is compared with respect to an approach based on the lattice posterior probabilities.

Index Terms: confidence measure, speech therapy

Full Paper

Bibliographic reference.  Yin, Shou-Chun / Rose, Richard C. / Tang, Yun (2012): "Verifying session level pronunciation accuracy in a speech therapy application", In INTERSPEECH-2012, 1331-1334.