5th European Conference on Speech Communication and Technology

Rhodes, Greece
September 22-25, 1997

Processing Linear Prediction Residual for Speech Enhancement

B. Yegnanarayana (1), Carlos Avendano (2), Hynek Hermansky (2), P. Satyanarayana Murthy (1)

(1) Department of Computer Science and Engineering Indian Institute of Technology, Madras, India (2)Department of Electrical Engineering Oregon Graduate Institute of Science & Technology, Portland, Oregon, USA

In this paper we propose a method for enhancement of speech in the presence of additive noise. The objective is to selectively enhance the high SNR regions in the noisy speech in the temporal and spectral domains, without causing significant distortion in the resulting enhanced speech. This is proposed to be done at three different levels: (a) At the gross level, by identifying the regions of speech and noise in the temporal domain, (b) At the finer level, by identifying the regions of high and low SNR portions in the noisy speech, and (c) At the short-time spectrum level, by enhancing the spectral peaks over spectral valleys. Processing of noisy speech for enhancement involves mostly weighting the LP residual samples. The weighted residual samples are used to excite the time- varying LP filter to produce enhanced speech.

Full Paper

Bibliographic reference.  Yegnanarayana, B. / Avendano, Carlos / Hermansky, Hynek / Murthy, P. Satyanarayana (1997): "Processing linear prediction residual for speech enhancement", In EUROSPEECH-1997, 1399-1402.