We propose a novel method of pitch track correction that uses an ensemble Kalman filter to improve the performance of monaural speech segregation. The proposed method considers all reliable pitch streaks for pitch track correction, whereas the conventional segregation approach relies on only the longest streak in a given speech stream. In addition, unreliable pitch streaks are corrected with an ensemble Kalman filter that uses autocorrelation functions as noisy observations for the hidden true pitch values. Our proposed approach provides more accurate pitch estimation, thus improving speech segregation performance for various types of noises, in particular, colored noise. In speech segregation experiments on mixtures of speech and various competing noises, the proposed method demonstrated superior performance to the conventional approach.
Bibliographic reference. Kim, Han-Gyu / Jang, Gil-Jin / Park, Jeong-Sik / Oh, Yung-Hwan (2013): "Monaural speech segregation based on pitch track correction using an ensemble kalman filter", In INTERSPEECH-2013, 813-816.