14thAnnual Conference of the International Speech Communication Association

Lyon, France
August 25-29, 2013

A Two-Step Technique for MRI Audio Enhancement Using Dictionary Learning and Wavelet Packet Analysis

Colin Vaz, Vikram Ramanarayanan, Shrikanth Narayanan

University of Southern California, USA

We present a method for speech enhancement of data collected in extremely noisy environments, such as those found during magnetic resonance imaging (MRI) scans. We propose a two-step algorithm to perform this noise suppression. First, we use probabilistic latent component analysis to learn dictionaries of the noise and speech+noise portions of the data and use these to factor the noisy spectrum into estimated speech and noise components. Second, we apply a wavelet packet analysis in conjunction with a wavelet threshold that minimizes the KL divergence between the estimated speech and noise to achieve further noise suppression. Based on both objective and subjective assessments, we find that our algorithm significantly outperforms traditional techniques such as nLMS, while not requiring prior knowledge or periodicity of the noise waveforms that current state-of-the-art algorithms require.

Full Paper

Bibliographic reference.  Vaz, Colin / Ramanarayanan, Vikram / Narayanan, Shrikanth (2013): "A two-step technique for MRI audio enhancement using dictionary learning and wavelet packet analysis", In INTERSPEECH-2013, 1312-1315.