5th European Conference on Speech Communication and Technology

Rhodes, Greece
September 22-25, 1997

Integrating Acoustic and Labial Information for Speaker Identification and Verification

Pierre Jourlin (1,2), Juergen Luettin (1), Dominique Genoud (1), Hubert Wassner (1)

(1) IDIAP, Martigny, Switzerland (2) LIA, Avignon Gedex 9, France

This paper describes a multimodal approach for speaker verification. The system consists of two classifiers, one using visual features and the other using acoustic features. A lip tracker is used to extract visual information from the speaking face which provides shape and intensity features. We describe an approach for normalizing and mapping different modalities onto a common confidence interval. We also describe a novel method for integrating the scores of multiple classifiers. Verification experiments are reported for the individual modalities and for the combined classifier. The performance of the integrated system outperformed each sub-system and reduced the false acceptance rate of the acoustic sub-system from 2.3% to 0.5%.

Full Paper

Bibliographic reference.  Jourlin, Pierre / Luettin, Juergen / Genoud, Dominique / Wassner, Hubert (1997): "Integrating acoustic and labial information for speaker identification and verification", In EUROSPEECH-1997, 1603-1606.