This paper uses a crowd-sourced definition of a speech phenomenon we have called "focus". Given sentences, text and speech, in isolation and in context, we asked annotators to identify what we term the "focus" word. We present their consistency in identifying the focused word, when presented with text or speech stimuli. We then build models to show how well we predict that focus word from lexical (and higher) level features. Also, using spectral and prosodic information, we show the differences in these focus words when spoken with and without context. Finally, we show how we can improve speech synthesis of these utterances given focus information.
Bibliographic reference. Hovy, Dirk / Anumanchipalli, Gopala Krishna / Parlikar, Alok / Vaughn, Caroline / Lammert, Adam / Hovy, Eduard / Black, Alan W. (2013): "Analysis and modeling of “focus” in context", In INTERSPEECH-2013, 402-406.