Ninth International Conference on Spoken Language Processing

Pittsburgh, PA, USA
September 17-21, 2006

State-Level Variable Modeling for Phoneme Classification

Hao-Zheng Li, Douglas O'Shaughnessy

Universite du Quebec, Canada

In HMM-based pattern recognition, the structure of the HMM is often predetermined according to some prior knowledge. In the recognition process, we usually make our judgment based on the maximum likelihood of the HMM, without considering the time-varying property of state-level variables, which unfortunately may lead to incorrect results. In this paper, we analyze the property of state-level variables in the HMM and show it is possible to significantly enhance the performance of speech recognition systems when using the state-level variable time-varying property. We propose four methods to model state-level variable trajectories and then test them on a phoneme classification task on the TIMIT speech corpus, 11.95% error rate reduction is achieved and some empirical conclusions are drawn.

Full Paper

Bibliographic reference.  Li, Hao-Zheng / O'Shaughnessy, Douglas (2006): "State-level variable modeling for phoneme classification", In INTERSPEECH-2006, paper 1332-Mon3BuP.5.