EUROSPEECH 2001 Scandinavia
7th European Conference on Speech Communication and Technology
2nd INTERSPEECH Event

Aalborg, Denmark
September 3-7, 2001

                 

Extractive Summarization of Voicemail using Lexical and Prosodic Feature Subset Selection

Konstantinos Koumpis, Steve Renals, Mahesan Niranjan

University of Sheffield, UK

This paper presents a novel data-driven approach to summarizing spoken audio transcripts utilizing lexical and prosodic features. The former are obtained from a speech recognizer and the latter are extracted automatically from speech waveforms. We employ a feature subset selection algorithm, based on ROC curves, which examines different combinations of features at different target operating conditions. The approach is evaluated on the IBM Voicemail corpus, demonstrating that it is possible and desirable to avoid complete commitment to a single best classifier or feature set.

Full Paper

Bibliographic reference.  Koumpis, Konstantinos / Renals, Steve / Niranjan, Mahesan (2001): "Extractive summarization of voicemail using lexical and prosodic feature subset selection", In EUROSPEECH-2001, 2377-2380.