Fourth European Conference on Speech Communication and Technology

Madrid, Spain
September 18-21, 1995

Direct and Joint-Space Approaches to the Use of Spectral Transformation for Speaker Adaptation in Continuous Speech Recognition

H. C. Choi, R. W. King

Speech Technology Research Group, Department of Electrical Engineering, University of Sydney, Australia

This paper describes three different approaches to the use of spectral transformation for supervised speaker adaptation in continuous speech recognition. Each approach may involve transforming feature vectors of the speech of a speaker, transforming mean vectors of the HMMs of a reference recognition system, or transforming both feature vectors and HMMs. A comparison of these approaches is investigated using the ARPA 1000-word Resource Management (RM1) continuous speech corpus. Using the average speaker-independent (SI) test result as a reference point, it is found that the best adaptation approach can achieve an error reduction of 23.5% by using 10 sentences as adaptation speech.

Full Paper

Bibliographic reference.  Choi, H. C. / King, R. W. (1995): "Direct and joint-space approaches to the use of spectral transformation for speaker adaptation in continuous speech recognition", In EUROSPEECH-1995, 1151-1154.