Speech Prosody 2006
In a study of optical cues to the visual perception of stress, three American English talkers spoke words that differed in lexical stress and sentences that differed in phrasal stress, while video and movements of the face were recorded. In a production analysis, stressed vs. unstressed syllables from these utterances were compared along many measures of facial movement, which were generally larger and faster under stress. In a visual perception experiment, 16 perceivers identified the location of stress in forced-choice judgments of video clips of these utterances (without audio). Phrasal stress (54% correct vs. 25% chance) was better-perceived than lexical stress (62% correct vs. 50% chance). The relation of the visual intelligibility of the prosody of these utterances to the optical characteristics of their production is discussed, with analysis of which cues are associated with successful visual perception.
Bibliographic reference. Scarborough, Rebecca / Keating, Patricia / Baroni, Marco / Cho, Taehong / Mattys, Sven / Alwan, Abeer / Auer Jr, Edward / Bernstein, Lynne E. (2006): "Optical cues to the visual perception of lexical and phrasal stress in English", In SP-2006, paper 059.