ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing

ICC Jeju, Korea
October 3, 2004

Auditory Segmentation Based on Event Detection

Guoning Hu (1), DeLiang Wang (2)

(1) Biophysics Program; (2) Department of Computer Science and Engineering & Center for Cognitive Science, The Ohio State University, Columbus, OH, USA

Acoustic signals from different sources in a natural environment form an auditory scene. Auditory scene analysis (ASA) is the process in which the auditory system segregates an auditory scene into streams corresponding to different sources. Segmentation is an important stage of ASA where an auditory scene is decomposed into segments, each of which contains signal mainly from one source. We propose a system for auditory segmentation based on analyzing onsets and offsets of auditory events. Our system first detects onsets and offsets, and then generates segments by matching corresponding onsets and offsets. This is achieved through a multiscale approach based on scale-space theory. Systematic evaluation shows that much target speech, including unvoiced speech, is correctly segmented, and target speech and interference are well separated into different segments.

Full Paper

Bibliographic reference.  Hu, Guoning / , DeLiang Wang (2004): "Auditory segmentation based on event detection", In SAPA-2004, paper 62.