5th European Conference on Speech Communication and Technology

Rhodes, Greece
September 22-25, 1997

Using Formant Frequencies in Speech Recognition

John N. Holmes (1), Wendy J. Holmes (2), Philip N. Garner (2)

(1) Speech Technology Consultant, Uxbridge, U.K. (2) Speech Research Unit, DRA Malvern, Malvern, Worcs., U.K.

Formant frequencies have rarely been used as acoustic features for speech recognition, in spite of their phonetic significance. For some speech sounds one or more of the formants may be so badly defined that it is not useful to attempt a frequency measurement. Also, it is often difficult to decide which formant labels to attach to particular spectral peaks. This paper describes a new method of formant analysis which includes techniques to overcome both of the above difficulties. Using the same data and HMM model structure, results are compared between a recognizer using conventional cepstrum features and one using three formant frequencies, combined with fewer cepstrum features to represent general spectral trends. For the same total number of features, results show that including formant features can offer increased accuracy over using cepstrum features only.

Full Paper

Bibliographic reference.  Holmes, John N. / Holmes, Wendy J. / Garner, Philip N. (1997): "Using formant frequencies in speech recognition", In EUROSPEECH-1997, 2083-2086.