EUROSPEECH 2001 Scandinavia
This paper proposes a robust and accurate F0 estimation method for noisy speech. This method uses two different principles: (1) an F0 estimation based on periodicity and harmonicity of instantaneous amplitude for a robust estimation in noisy environments, and (2) an F0 estimation based on stability of instantaneous frequency as an accurate estimation method. The proposed method also uses a comb filter with controllable pass-bands to combine the two estimation methods. Simulations were carried out to estimate F0s from real speech in noisy environments and to compare the proposed method with other methods. The results showed that this method can not only estimate F0s for clean speech with similar accuracy as the method using only instantaneous frequency but also robustly estimate F0s from noisy speech in comparison with the other methods such as the cepstrum method.
Bibliographic reference. Ishimoto, Yuichi / Unoki, Masashi / Akagi, Masato (2001): "A fundamental frequency estimation method for noisy speech based on instantaneous amplitude and frequency", In EUROSPEECH-2001, 2439-2442.