EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology

Aalborg, Denmark
September 3-7, 2001


A Robust Speaker Verification System against Imposture Using an HMM-based Speech Synthesis System

Takayuki Satoh (1), Takashi Masuko (1), Takao Kobayashi (1), Keiichi Tokuda (2)

(1) Tokyo Institute of Technology, Japan
(2) Nagoya Institute of Technology, Japan

This paper describes a text-prompted speaker verification system which is robust to imposture using synthetic speech generated by an HMM-based speech synthesis system. In the verification system, text and speaker are verified separately. Text verification is based on phoneme recognition using HMM, and speaker verification is based on GMM. To discriminate synthetic speech from natural speech, an average of inter-frame difference of the log likelihood is calculated, and input speech is judged to be synthetic when this value is smaller than a decision threshold. Experimental results show that the false acceptance rate for synthetic speech was reduced drastically without significant increase of the false acceptance and rejection rates for natural speech.

Full Paper

Bibliographic reference.  Satoh, Takayuki / Masuko, Takashi / Kobayashi, Takao / Tokuda, Keiichi (2001): "A robust speaker verification system against imposture using an HMM-based speech synthesis system", In EUROSPEECH-2001, 759-762.