13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Study of the Effect of I-vector Modeling on Short and Mismatch Utterance Duration for Speaker Verification

Achintya Kumar Sarkar, Driss Matrouf, Pierre Michel Bousquet, Jean-François Bonastre

Université d'Avignon, LIA, Avignon, France

It is well known that state-of-the-art speaker verification system using i-vector concept shows prominent performance when target speakers training and test utterances are fixed conditions: long-long as per NIST evaluation. However, most of the real-time applications of speaker verification systems are limited to different training and test durations of the speech segments. State-of-the-art speaker verification system needs to estimate some statistical parameters. The aim of this paper is to explore how to train the statistical model parameter of the state-of-the-art system while speakers training and test data are on mismatch durations. Experimental results are shown on NIST 2008 SRE for various duration of target training and test speech segments, such as 5 seconds, 10 seconds and full (5 minutes).

Index Terms: short segment, i-vector, length normalization, PLDA, speaker verification

Full Paper

Bibliographic reference.  Sarkar, Achintya Kumar / Matrouf, Driss / Bousquet, Pierre Michel / Bonastre, Jean-François (2012): "Study of the effect of i-vector modeling on short and mismatch utterance duration for speaker verification", In INTERSPEECH-2012, 2662-2665.