4th International Conference on Spoken Language Processing

Philadelphia, PA, USA
October 3-6, 1996

The Use of Wavelet Transforms in Phoneme Recognition

Beng T. Tan (1), Minyue Fu (1), Andrew Spray (1), Phillip Dermody (2)

(1) Dept. of Electrical and Computer Engineering, The University of Newcastle, NSW, Australia
(2) National Acoustic Laboratories, NSW, Australia

This study investigates the usefulness of wavelet transforms in phoneme recognition. Both discrete wavelet transforms (DWT) and sampled continuous wavelet transforms (SCWT) are tested. The wavelet transform is used as a part of the front-end processor which extracts feature vectors for a speaker-independent HMM-based phoneme recognizer. The results are evaluated on a portion of TIMIT corpus consisting of 30293 phoneme tokens for training and 14489 phoneme tokens for testing. The test results suggest that SCWT gives considerably better recognition rate than DWT. On the other hand, the improvement of SCWT over Mel-scale cepstral coefficients appears to be marginal.

Full Paper

Bibliographic reference.  Tan, Beng T. / Fu, Minyue / Spray, Andrew / Dermody, Phillip (1996): "The use of wavelet transforms in phoneme recognition", In ICSLP-1996, 2431-2434.