Second International Conference on Spoken Language Processing (ICSLP'92)

Banff, Alberta, Canada
October 13-16, 1992

Robust Pitch Detection by Narrow Band Spectrum Analysis

Hiroshi Shimodaira (1), Mitsuru Nakai (2)

(1) School of Information Science, Japan Advanced Institute of Science and Technology, Ishikawa, Japan
(2) Dept. of Information Eng., Faculty of Engineering, Tohoku University, Sendai-shi, Japan

This paper proposes a new technique for detecting pitch patterns which is useful for automatic speech recognition, by using a narrow band spectrum analysis. The motivation of this approach is that humans perceive some kind of pitch in whispers where no fundamental frequencies can be observed, while most of the pitch determination algorithm (PDA) fails to detect such perceptual pitch. The narrow band spectrum analysis enable us to find pitch structure distributed locally in frequency domain. Incorporating this technique into PDA's is realized to applying the technique to the lag window based PDA. Experimental results show that pitch detection performance could be improved by 4% for voiced sounds and 8% for voiceless sounds.

Full Paper

Bibliographic reference.  Shimodaira, Hiroshi / Nakai, Mitsuru (1992): "Robust pitch detection by narrow band spectrum analysis", In ICSLP-1992, 1597-1600.