4th International Conference on Spoken Language Processing

Philadelphia, PA, USA
October 3-6, 1996

Speaker Identification by Lipreading

Juergen Luettin (1,2), Neil A. Thacker (1), Steve W. Beet (1)

(1) Dept. of Electronic and Electrical Engineering, University of Sheffield, Sheffield, UK
(2) IDIAP, Martigny, Switzerland

This paper describes a new approach for speaker identification based on lipreading. Visual features are extracted from image sequences of the talking face and consist of shape parameters which describe the lip boundary and intensity parameters which describe the grey-level distribution of the mouth area. Intensity information is based on principal component analysis using eigenspaces which deform with the shape model. The extracted parameters account for both, speech dependent and speaker dependent information. We built spatio-temporal speaker models based on these features, using HMMs with mixtures of Gaussians. Promising results were obtained for text dependent and text independent speaker identification tests performed on a small video database.

Full Paper

Bibliographic reference.  Luettin, Juergen / Thacker, Neil A. / Beet, Steve W. (1996): "Speaker identification by lipreading", In ICSLP-1996, 62-65.