ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing

ICC Jeju, Korea
October 3, 2004

MAP Estimation of Speech Spectral Component Under GGD a Priori

Rajkishore Prasad, Hiroshi Saruwatari, Kiyohiro Shikano

Graduate School of Information Science, Nara Institute of Science and Technology, Nara, Japan

This paper presents Maximum A Posteriori (MAP) estimation of the spectral components of clean speech from the observed data noised by the additive background noise having Gaussian or non-Gaussian statistical distribution. In the proposed algorithm MAP estimator for the spectral components of clean signal is derived using Generalized Gaussian Distribution (GGD) function as a priori statistical models for the spectral components of speech as well as noise. Since the spikiness of the GGD can be controlled by the shape parameter, it is possible to model Gaussian as well as non-Gaussian noise, corrupting the speech signal. The enhancement results for the speech signal corrupted by the Gaussian noise and non- Gaussian noise are presented to show the usefulness of the estimator. Denoising performance for the Laplacian noise and white Gaussian noise have also been compared with that of the conventional Wiener filtering, which assumes Gaussian distributions for both the speech and noise.

Full Paper

Bibliographic reference.  Prasad, Rajkishore / Saruwatari, Hiroshi / Shikano, Kiyohiro (2004): "MAP estimation of speech spectral component under GGD a priori", In SAPA-2004, paper 115.