13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Effect of Relevance Factor of Maximum a Posteriori Adaptation for GMM-SVM in Speaker and Language Recognition

Changhuai You, Haizhou Li, Bin Ma, Kong Aik Lee

Institute for Infocomm Research (I2R), A*STAR, Singapore

Gaussian mixture model - support vector machine (GMM-SVM) with nuisance attribute projection (NAP) has been found to be effective and reliable for speaker and language recognition. In maximum a posteriori (MAP) adaptation of GMM, the relevance factor is the parameter that regulates how much the adaptation data affect the base model, which impacts the final recognition performance. In our previous work, the data-dependent relevance factor and adaptive relevance factor have been introduced. In this paper, we provide insights into different types of relevance factor for MAP in the context of application as formulated under Speaker Recognition Evaluation (SRE) and Language Recognition Evaluation (LRE) by the National Institute of Standards and Technology (NIST).

Index Terms: maximum a posteriori, supervector, Gaussian mixture model, support vector machine

Full Paper

Bibliographic reference.  You, Changhuai / Li, Haizhou / Ma, Bin / Lee, Kong Aik (2012): "Effect of relevance factor of maximum a posteriori adaptation for GMM-SVM in speaker and language recognition", In INTERSPEECH-2012, 2065-2068.