Ninth International Conference on Spoken Language Processing

Pittsburgh, PA, USA
September 17-21, 2006

A Simulated-Data Adaptation Technique for Robust Speech Recognition

Nattanun Thatphithakkul (1), Boontee Kruatrachue (1), Chai Wutiwiwatchai (2), Sanparith Marukatat (2), Vataya Boonpiam (2)

(1) King Mongkut’s Institude of Technology Ladkrabang, Thailand; (2) NECTEC, Thailand

This paper proposes an efficient acoustic model adaptation method based on the use of simulated-data in maximum likelihood linear regression (MLLR) adaptation for robust speech recognition. Online MLLR adaptation is an unsupervised process which requires an input speech with phone labels transcribed automatically. Instead of using only the input signal in adaptation, our proposed simulated data method increases the size of adaptation data by adding noise portions extracted from the input speech to a set of pre-recorded clean speech, whose correct transcriptions are known. Various configurations of the proposed method are explored. Evaluations are performed with both additive and real noisy speech. The experimental results show that the proposed system achieves higher recognition rate than the system using only the input speech in adaptation and the system using a multi-conditioned acoustic model.

Full Paper

Bibliographic reference.  Thatphithakkul, Nattanun / Kruatrachue, Boontee / Wutiwiwatchai, Chai / Marukatat, Sanparith / Boonpiam, Vataya (2006): "A simulated-data adaptation technique for robust speech recognition", In INTERSPEECH-2006, paper 1157-Tue1A2O.3.