12th Annual Conference of the International Speech Communication Association

Florence, Italy
August 27-31. 2011

Automatic Detection of Speaker Attributes Based on Utterance Text

Wen Wang, Andreas Kathol, Harry Bratt

SRI International, USA

In this paper, we present models for detecting various attributes of a speaker based on uttered text alone. These attributes include whether the speaker is speaking his/her native language, the speaker's age and gender, and the regional information reported by the speakers. We explore various lexical features as well as features inspired by Linguistic Inquiry and Word Count and Dictionary of Affect in Language. Overall, results suggest that when audio data is not available, by exploring effective feature sets only from uttered text and system combinations of multiple classification algorithms, we can build high quality statistical models to detect these attributes of speakers, comparable to systems that can exploit the audio data.

Full Paper

Bibliographic reference.  Wang, Wen / Kathol, Andreas / Bratt, Harry (2011): "Automatic detection of speaker attributes based on utterance text", In INTERSPEECH-2011, 2361-2364.