12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Factor Analysis Back Ends for MLLR Transforms in Speaker Recognition

Nicolas Scheffer, Yun Lei, Luciana Ferrer

SRI International, USA

The purpose of this work is to show how recent developments in cepstral-based systems for speaker recognition can be leveraged for the use of Maximum Likelihood Linear Regression (MLLR) transforms. Speaker recognition systems based on MLLR transforms have shown to be greatly beneficial in combination with standard systems, but most of the advances in speaker modeling techniques have been implemented for cepstral features. We show how these advances, based on Factor Analysis, such as eigenchannel and ivector, can be easily employed to achieve very high accuracy. We show that they outperform the current state-of-the-art MLLR-SVM system that SRI submitted during the NIST SRE 2010 evaluation. The advantages of leveraging the new approaches are manyfold: the ability to process a large amount of data, working in a reduced dimensional space, importing any advances made for cepstral systems to the MLLR features, and the potential for system combination at the i-vector level.

Full Paper

Bibliographic reference.  Scheffer, Nicolas / Lei, Yun / Ferrer, Luciana (2011): "Factor analysis back ends for MLLR transforms in speaker recognition", In INTERSPEECH-2011, 257-260.