EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology

Aalborg, Denmark
September 3-7, 2001


Viseme Recognition Using Multiple Feature Matching

I. Shdaifat (1), R. Grigat (1), Stefan Lütgert (2)

(1) TU Hamburg Harburg, Vision Systems, Germany
(2) Philips Semiconductors, Systems Laboratory Hamburg, Germany

In this paper, we present a technique for the extraction of the five main visemes produced in natural speech for German. The method belongs to the LDA (Linear Discriminant Analysis) family. The intensity, the edges, and the line segments are used to locate the lips automatically and for viseme classification. Using many features in the recognition maximizes the probability of recognition rate. The corners of the mouth are used in case of small rotation and scale. An experiment has been carried out on different people, to understand the part of the speech that the human being use. The people grouped the phonemes into five different visemes. The number of distinguished visemes is not the same for each speaker. Everyone express the speech in a different visemes. Good recognition rate has been achieved on different speaker.

Full Paper

Bibliographic reference.  Shdaifat, I. / Grigat, R. / Lütgert, Stefan (2001): "Viseme recognition using multiple feature matching", In EUROSPEECH-2001, 2431-2434.