Odyssey 2012 - The Speaker and Language Recognition Workshop

June 25-28, 2012

A Global Optimization Framework For Speaker Diarization

Mickael Rouvier, Sylvain Meignier

LIUM, Université du Maine – Le Mans, France

In this paper, we propose a new clustering model for speaker diarization. A major problem with using greedy agglomerative hierarchical clustering for speaker diarization is that they do not guarantee an optimal solution. We propose a new clustering model, by redefining clustering as a problem of Integer Linear Programming (ILP). Thus an ILP solver can be used which searches the solution of speaker clustering over the whole problem. The experiments were conducted on the corpus of French broadcast news ESTER-2. With this new clustering, the DER decreases by 2.43 points.

Full Paper

Bibliographic reference.  Rouvier, Mickael / Meignier, Sylvain (2012): "A global optimization framework for speaker diarization", In Odyssey-2012, 146-150.