ISCA Archive Interspeech 2013
ISCA Archive Interspeech 2013

Improving the PLDA based speaker verification in limited microphone data conditions

A. Kanagasundaram, D. Dean, Javier Gonzalez-Dominguez, S. Sridharan, D. Ramos, Joaquin Gonzalez-Rodriguez

A significant amount of speech data is required to develop a robust speaker verification system, but it is difficult to find enough development speech to match all expected conditions. In this paper we introduce a new approach to Gaussian probabilistic linear discriminant analysis (GPLDA) to estimate reliable model parameters as a linearly weighted model taking more input from the large volume of available telephone data and smaller proportional input from limited microphone data. In comparison to a traditional pooled training approach, where the GPLDA model is trained over both telephone and microphone speech, this linear-weighted GPLDA approach is shown to provide better EER and DCF performance in microphone and mixed conditions in both the NIST 2008 and NIST 2010 evaluation corpora. Based upon these results, we believe that linear-weighted GPLDA will provide a better approach than pooled GPLDA, allowing for the further improvement of GPLDA speaker verification in conditions with limited development data.

doi: 10.21437/Interspeech.2013-689

Cite as: Kanagasundaram, A., Dean, D., Gonzalez-Dominguez, J., Sridharan, S., Ramos, D., Gonzalez-Rodriguez, J. (2013) Improving the PLDA based speaker verification in limited microphone data conditions. Proc. Interspeech 2013, 3674-3678, doi: 10.21437/Interspeech.2013-689

  author={A. Kanagasundaram and D. Dean and Javier Gonzalez-Dominguez and S. Sridharan and D. Ramos and Joaquin Gonzalez-Rodriguez},
  title={{Improving the PLDA based speaker verification in limited microphone data conditions}},
  booktitle={Proc. Interspeech 2013},