4th International Conference on Spoken Language Processing

Philadelphia, PA, USA
October 3-6, 1996

A Unified Spectral Transformation Adaptation Approach for Robust Speech Recognition

Lei Yao, Dong Yu, Taiyi Huang

National Lab of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China

In this paper, Canonical Correlation Based Compensation (CCBC) is proposed as an unified approach to cope with the mismatch between training and test set. The mismatch between training and test conditions can be simply clustered into three classes: differences of speakers, changes of recording channel and effects of noisy environment. In previous work, we had used CCBC approach with some modifications to make our speech recognizer robust to the noisy environment successfully [1]. Recently, the same approach has been extended for speaker and channel adaptation. The results of our experiments show that CCBC approach well compensated all three kinds of distortion source between training and test conditions. In order to compare the performance of CCBC with that of some conventional adaptation approaches, the capacities of the techniques of cepstral mean normalization, RASTA and Lin-Log RASTA are tested. We find that CCBC has better performance than them. As an very important problem in CCBC approach, the selection of appropriate reference speech data is also discussed in this paper.

Full Paper

Bibliographic reference.  Yao, Lei / Yu, Dong / Huang, Taiyi (1996): "A unified spectral transformation adaptation approach for robust speech recognition", In ICSLP-1996, 981-984.