EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology
2nd INTERSPEECH Event

Aalborg, Denmark
September 3-7, 2001

                 

A Fundamental Frequency Estimation Method for Noisy Speech Based on Instantaneous Amplitude and Frequency

Yuichi Ishimoto (1), Masashi Unoki (2), Masato Akagi (1)

(1) Japan Advanced Institute of Science and Technology, Japan
(2) University of Cambridge, UK

This paper proposes a robust and accurate F0 estimation method for noisy speech. This method uses two different principles: (1) an F0 estimation based on periodicity and harmonicity of instantaneous amplitude for a robust estimation in noisy environments, and (2) an F0 estimation based on stability of instantaneous frequency as an accurate estimation method. The proposed method also uses a comb filter with controllable pass-bands to combine the two estimation methods. Simulations were carried out to estimate F0s from real speech in noisy environments and to compare the proposed method with other methods. The results showed that this method can not only estimate F0s for clean speech with similar accuracy as the method using only instantaneous frequency but also robustly estimate F0s from noisy speech in comparison with the other methods such as the cepstrum method.

Full Paper

Bibliographic reference.  Ishimoto, Yuichi / Unoki, Masashi / Akagi, Masato (2001): "A fundamental frequency estimation method for noisy speech based on instantaneous amplitude and frequency", In EUROSPEECH-2001, 2439-2442.