EUROSPEECH 2001 Scandinavia
We discuss the multi-speaker tasks of detection, tracking, and segmentation of speakers as included in recent NIST Speaker Recognition Evaluations. We consider how performance for the two-speaker detection task is related to that for the corresponding one-speaker task. We examine the effects of target speaker speech duration and the gender mix within test segments on results for these tasks. We also relate performance results for the tracking and segmentation tasks, and look at factors affecting segmentation performance.
Bibliographic reference. Martin, Alvin F. / Przybocki, Mark A. (2001): "Speaker recognition in a multi-speaker environment", In EUROSPEECH-2001, 787-790.