INTERSPEECH 2006 - ICSLP
In HMM-based pattern recognition, the structure of the HMM is often predetermined according to some prior knowledge. In the recognition process, we usually make our judgment based on the maximum likelihood of the HMM, without considering the time-varying property of state-level variables, which unfortunately may lead to incorrect results. In this paper, we analyze the property of state-level variables in the HMM and show it is possible to significantly enhance the performance of speech recognition systems when using the state-level variable time-varying property. We propose four methods to model state-level variable trajectories and then test them on a phoneme classification task on the TIMIT speech corpus, 11.95% error rate reduction is achieved and some empirical conclusions are drawn.
Bibliographic reference. Li, Hao-Zheng / O'Shaughnessy, Douglas (2006): "State-level variable modeling for phoneme classification", In INTERSPEECH-2006, paper 1332-Mon3BuP.5.