Ninth International Conference on Spoken Language Processing

Pittsburgh, PA, USA
September 17-21, 2006

A TextTiling Based Approach to Topic Boundary Detection in Meetings

Satanjeev Banerjee, Alexander I. Rudnicky

Carnegie Mellon University, USA

Our goal is to automatically detect boundaries between discussions of different topics in meetings. Towards this end we adapt the TextTiling algorithm [1] to the context of meetings. Our features include not only the overlapped words between adjacent windows, but also overlaps in the amount of speech contributed by each meeting participant. We evaluate our algorithm by comparing the automatically detected boundaries with the true ones, and computing precision, recall and f-measure. We report average precision of 0.85 and recall of 0.59 when segmenting unseen test meetings. Error analysis of our results shows that although the basic idea of our algorithm is sound, it breaks down when participants stray from typical behavior (such as when they monopolize the conversation for too long).

Full Paper

Bibliographic reference.  Banerjee, Satanjeev / Rudnicky, Alexander I. (2006): "A texttiling based approach to topic boundary detection in meetings", In INTERSPEECH-2006, paper 1827-Mon1A3O.3.