EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology
2nd INTERSPEECH Event

Aalborg, Denmark
September 3-7, 2001

                 

Entropy Based Voice Activity Detection in Very Noisy Conditions

Philippe Renevey (1), Andrzej Drygajlo (2)

(1) CSEM, Switzerland; (2) EPFL, Switzerland

This paper addresses the problem of robust voice activity detection (VAD) capable for working at very low signal-to-noise ratios (SNR<10dB). A new algorithm that we propose is based on entropy estimation measures of the time-frequency magnitude spectrum. The problem of the estimation of the distribution of noise in detected non-speech segments of analysed signal is also presented. It is shown that the new entropy based VAD significantly outperforms the commonly used energy-based algorithms in all (stationary, non-stationary, white and coloured) noise conditions at SNRs from 10 dB down to -10 dB and below. One of the main advantages of the method proposed in this paper is that it is not very sensitive to the changing level of noise.

Full Paper

Bibliographic reference.  Renevey, Philippe / Drygajlo, Andrzej (2001): "Entropy based voice activity detection in very noisy conditions", In EUROSPEECH-2001, 1887-1890.