Auditory-Visual Speech Processing (AVSP) 2011
Spoken face to face interaction is a rich and complex form of communication that includes a wide array of phenomena that are not fully explored or understood. While there has been extensive studies on many aspects in face-to-face interaction, these are traditionally of a qualitative nature, relying on hand annotated corpora, typically rather limited in extent, which is a natural consequence of the labour intensive task of multimodal data annotation. In this paper we present a corpus of 60 hours of unrestricted Swedish face-to-face conversations recorded with audio, video and optical motion capture, and we describe a new project setting out to exploit primarily the kinetic data in this corpus in order to gain quantitative knowledge on human face-to-face interaction.
Index Terms. motion capture, face-to-face conversation, multimodal corpus.
Bibliographic reference. Beskow, Jonas / Alexandersson, Simon / Al Moubayed, Samer / Edlund, Jens / House, David (2011): "Kinetic data for large-scale analysis and modeling of face-to-face conversation", In AVSP-2011, 107-110.