4th International Conference on Spoken Language Processing

Philadelphia, PA, USA
October 3-6, 1996

Dynamic Features for Segmental Speech Recognition

Naomi Harte (1), Saeed V. Vaseghi (1), Ben Milner (2)

(1) School of Electrical Engineering and Computer Science, The Queen's University of Belfast, Belfast, N.Ireland, UK
(2) British Telecom Research Laboratories, UK

Speech models and features that emphasise the dynamic aspects of speech can provide improved speech recognition. The cepstral time matrix has been established as a successful method of encoding dynamics. This paper extends this set of dynamic features, considering cepstral time features on both a segmental and subsegmental level. This offers the potential of using a conditional pdf for the state observation within a HMM and incorporating this into the training stage. Methods of linear discriminative analysis are applied to the new feature set to identify the subset of features making the greatest contribution to the task of recognition.

Full Paper

Bibliographic reference.  Harte, Naomi / Vaseghi, Saeed V. / Milner, Ben (1996): "Dynamic features for segmental speech recognition", In ICSLP-1996, 933-936.