13th Annual Conference of the International Speech Communication Association

Portland, OR, USA
September 9-13, 2012

Integrating Intra-Speaker Topic Modeling and Temporal-Based Inter-Speaker Topic Modeling in Random Walk for Improved Multi-Party Meeting Summarization

Yun-Nung Chen, Florian Metze

Language Technologies Institute, School of Computer Science, Carnegie Mellon University, Pittsburgh, PA, USA

This paper proposes an improved approach of summarization for spoken multi-party interaction, in which intra-speaker and inter-speaker topics are modeled in a graph constructed with topical relations. Each utterance is represented as a node of the graph and the edge between two nodes is weighted by the similarity between the two utterances, which is topical similarity evaluated by probabilistic latent semantic analysis (PLSA). We model intra-speaker topics by sharing the topics from the same speaker and inter-speaker topics by partially sharing the topics from the adjacent utterances based on temporal information. We did experiments for ASR and manual transcripts. For both transcripts, experiments showed combining intra-speaker and inter-speaker topic modeling can help include the important utterances to offer the improvement for summarization.

Index Terms: summarization, multi-party meeting, topic model, probabilistic latent semantic analysis (PLSA), topic transition, temporal information, random walk

Full Paper

Bibliographic reference.  Chen, Yun-Nung / Metze, Florian (2012): "Integrating intra-speaker topic modeling and temporal-based inter-speaker topic modeling in random walk for improved multi-party meeting summarization", In INTERSPEECH-2012, 2346-2349.