5th European Conference on Speech Communication and Technology

Rhodes, Greece
September 22-25, 1997

Noisy Speech Enhancement by Fusion of Auditory and Visual Information: A Study of Vowel Transitions

Laurent Girin, Gang Feng, Jean-Luc Schwartz

Institut de la Communication Parlee, UPRESA 5009 INPG/ENSERG/Universite Stendhal, Grenoble Cedex 09, France

This paper deals with a noisy speech enhancement technique based on the fusion of auditory and visual information. We first present the global structure of the system, and then we focus on the tool we used to melt both sources of information. The whole noise reduction system is implemented in the context of vowel transitions corrupted with white noise. A complete evaluation of the system in this context is presented, including distance measures, gaussian classification scores, and a perceptive test. The results are very promising.

Full Paper   Acoustic Example

Bibliographic reference.  Girin, Laurent / Feng, Gang / Schwartz, Jean-Luc (1997): "Noisy speech enhancement by fusion of auditory and visual information: a study of vowel transitions", In EUROSPEECH-1997, 2555-2558.