Speech Prosody 2010

Chicago, IL, USA
May 10-14, 2010

Using Prosodic Features for Predicting Phrase Boundaries

Caroline Kaufhold, Elmar Nöth

Chair of Pattern Recognition, University of Erlangen-Nuremberg, Germany

Spoken input of address data in modern GPS units is typically done by filling one information slot after another. To fill-in multiple slots at once, the particular slot information contained in the input utterance has to be extracted. We employ phrase boundaries to separate the speech signal into certain slots. In our evaluation, several types of input utterances differing in the number of slot information and their order are thoroughly examined. For each type, a set of twenty strong prosodic features is trained. By incorporating supporting a-priori features, an Fmeasure value of 93:0% is reached for a typical use case.

Index Terms: prosody, phrase boundary detection, multislot input modality

Full Paper

Bibliographic reference.  Kaufhold, Caroline / Nöth, Elmar (2010): "Using prosodic features for predicting phrase boundaries", In SP-2010, paper 872.