4th International Conference on Spoken Language Processing
Philadelphia, PA, USA
This study investigates the usefulness of wavelet transforms in phoneme recognition. Both discrete wavelet transforms (DWT) and sampled continuous wavelet transforms (SCWT) are tested. The wavelet transform is used as a part of the front-end processor which extracts feature vectors for a speaker-independent HMM-based phoneme recognizer. The results are evaluated on a portion of TIMIT corpus consisting of 30293 phoneme tokens for training and 14489 phoneme tokens for testing. The test results suggest that SCWT gives considerably better recognition rate than DWT. On the other hand, the improvement of SCWT over Mel-scale cepstral coefficients appears to be marginal.
Bibliographic reference. Tan, Beng T. / Fu, Minyue / Spray, Andrew / Dermody, Phillip (1996): "The use of wavelet transforms in phoneme recognition", In ICSLP-1996, 2431-2434.