EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology

Aalborg, Denmark
September 3-7, 2001


Speaker Recognition based on Idiolectal Differences between Speakers

George Doddington

National Institute of Standards and Technology, USA

Familiar speaker information is explored using non-acoustic features in NIST's new extended data speaker detection task. Word unigrams and bigrams, used in a traditional target/background likelihood ratio framework, are shown to give surprisingly good performance. Performance continues to improve with additional training and/or test data. Bigram performance is also found to be a function of target/model sex and age difference. These initial experiments strongly suggest that further exploration of familiar speaker characteristics will likely be an extremely interesting and valuable research direction for recognition of speakers in conversational speech.

Full Paper

Bibliographic reference.  Doddington, George (2001): "Speaker recognition based on idiolectal differences between speakers", In EUROSPEECH-2001, 2521-2524.