4th International Conference on Spoken Language Processing

Philadelphia, PA, USA
October 3-6, 1996

Increasing Robustness in GMM Speaker Recognition Systems for Noisy and Reverberant Speech with Low Complexity Microphone Arrays

Joaquín González-Rodríguez (1), Javier Ortega-García (1), César Martin (2), Luis Hernández (2)

(1) DIAC - EUIT Telecomunicación, Universidad Politécnica de Madrid, Spain
(2) GAPS - SSR - ETSIT, UPM, Madrid, España

In this paper we describe the additive robustness obtained through the combined use of a first acoustic processing step based on a low complexity microphone array, followed by a spectral normalization step. Microphone arrays have shown to provide good results in reducing different sources of acoustic degradation. However, microphone arrays produce linear filtering effects that need to be compensated in order to obtain a minimal spectral distortion. In this contribution we will present the combination of a microphone array together with different well known spectral normalization techniques as preprocessing stages to a Gaussian Mixture Models (GMM) based text-independent speaker recognition system. We will show that the combination of these extensively used techniques in the fields of speech enhancement and robust speaker recognition respectively, greatly improves the results obtained when the system is tested in noisy reverberant environments with short utterances from unconstrained conversational speech.

Full Paper

Bibliographic reference.  González-Rodríguez, Joaquín / Ortega-García, Javier / Martin, César / Hernández, Luis (1996): "Increasing robustness in GMM speaker recognition systems for noisy and reverberant speech with low complexity microphone arrays", In ICSLP-1996, 1333-1336.