5th European Conference on Speech Communication and Technology

Rhodes, Greece
September 22-25, 1997


Claude Montaciť, Marie-Josť Caraty

LIP6 - Universite Pierre et Marie Curie - CNRS, Paris Cedex 5, France

We present in this paper preliminary results using speaker recognition and speech recognition techniques, designed at LIP6, to index audio data of video movies. The assumption that only one person is speaking at the same time is made. In a first approach, we work on dialogue unsupervised indexing using speaker recognition techniques. For this purpose, we develop Silence/Noise/Music/Speech detection algorithms in order to cut audio data in segments that we hope to be homogeneous in terms of speaker appartenance. In a second approach, we develop a supervised audio data indexing method knowing the movie script.

Full Paper

Bibliographic reference.  Montaciť, Claude / Caraty, Marie-Josť (1997): "SOUND CHANNEL VIDEO INDEXING", In EUROSPEECH-1997, 2359-2362.