5th European Conference on Speech Communication and Technology

Rhodes, Greece
September 22-25, 1997

A Nonstationary Autoregressive HMM and its Application to Speech Enhancement

Ki Yong Lee (1), Jae Yeol Rheem (2)

(1) Dept. of Electronics Engr., Changwon National University, Korea (2) Dept. of Electronics Engr., Korea Institute of Technology and Education, Chonan, Korea

Since speech sounds, such as fricative, glides, liquids, diphthongs, and transition regions between phones, reveal the most notable nonstationary nature, we propose the nonstationary autoregressive (AR) HMM with state-dependent polynomial function for modeling the nature of speech. Then, the nonstationary AR model has parameters depend on the states of the Markov chain. It is designed to handle the speech signal at the frame level, where it is represented by the signal, rather than dealing with feature vectors directly. Also, we proposed a new speech enhancement based on the nonstationary AR HMM and the IMM algorithm under white noise condition. The proposed enhancement is the weighted sum of the parallel Kalman filters with interacting rule by IMM algorithm. The simulation results shows that the proposed method offers performance gains relative to the previous results [7] with slightly increased complexity.

Full Paper

Bibliographic reference.  Lee, Ki Yong / Rheem, Jae Yeol (1997): "A nonstationary autoregressive HMM and its application to speech enhancement", In EUROSPEECH-1997, 1407-1410.