EUROSPEECH 2001 Scandinavia
This paper describes a text-prompted speaker verification system which is robust to imposture using synthetic speech generated by an HMM-based speech synthesis system. In the verification system, text and speaker are verified separately. Text verification is based on phoneme recognition using HMM, and speaker verification is based on GMM. To discriminate synthetic speech from natural speech, an average of inter-frame difference of the log likelihood is calculated, and input speech is judged to be synthetic when this value is smaller than a decision threshold. Experimental results show that the false acceptance rate for synthetic speech was reduced drastically without significant increase of the false acceptance and rejection rates for natural speech.
Bibliographic reference. Satoh, Takayuki / Masuko, Takashi / Kobayashi, Takao / Tokuda, Keiichi (2001): "A robust speaker verification system against imposture using an HMM-based speech synthesis system", In EUROSPEECH-2001, 759-762.