5th International Conference on Spoken Language Processing

Sydney, Australia
November 30 - December 4, 1998

A Silence/Noise/Music/Speech Splitting Algorithm

Claude Montaciť, Marie-Josť Caraty

Laboratoire d'Informatique de Paris 6, France

In this paper, we present techniques to warp audio data of a video movie on its movie script. In order to improve this script warping, a new algorithm has been developed to split audio data into silence, noise, music and speech segments without training step. This segments splitting uses multiple techniques such as voiced/unvoiced segmentation, pitch detection, pitch tracking, speaker and speech recognition techniques. The 102.47 minutes of the film movie "Contes de Printemps" produced by E. Rohmer have been indexed with these techniques with an average shifting lower than one second between the time-code script and audio data.

Full Paper

Bibliographic reference.  Montaciť, Claude / Caraty, Marie-Josť (1998): "A silence/noise/music/speech splitting algorithm", In ICSLP-1998, paper 1141.