Speech Prosody 2006

Dresden, Germany
May 2-5, 2006

Evaluation of Pitch Detection Algorithms in Adverse Conditions

Bojan Kotnik (1), Harald Höge (2), Zdravko Kacic (1)

(1) University of Maribor, Slovenia
(2) Siemens AG, Corporate Technology, Munich, Germany

Robust fundamental frequency estimation in adverse conditions is important in various speech processing applications. In this paper a new pitch detection algorithm (PDA) based on the autocorrelation of the Hilbert envelope of the LP residual [1] is compared to another well established algorithm from Goncharoff [2]. A set of evaluation criteria is collected on which the two PDA algorithms are compared. In order to evaluate the algorithms in adverse conditions a suited reference database was constructed. This reference database consists of parts of the SPEECON speech database [3] where recordings of 60 speakers were selected and manually pitch marked. The recordings cover several adverse conditions as noise in the car cabin and reverberations of office rooms. The evaluation highlights the good performance of the new algorithm in comparison but shows, that low SNR conditions and strong reverberation are still a demanding challenge for future pitch detection algorithms.

References

  1. Mahadeva, S. R., and Yegnanarayana, B., 2004. Extraction of Pitch in Adverse Conditions. Proc. ICASSP 2004.
  2. Goncharoff, V., and Gries, P., 1998. An Algorithm for Accurately Marking Pitch Pulses in Speech Signals. IASTED International conference SIP ’98, Nevada, USA.
  3. Iskra, D. J. et al., 2002. SPEECON - Speech Databases for Consumer Devices: Database Specification and Validation. Proc. LREC'2002.

Full Paper

Bibliographic reference.  Kotnik, Bojan / Höge, Harald / Kacic, Zdravko (2006): "Evaluation of pitch detection algorithms in adverse conditions", In SP-2006, paper 083.