First Workshop on Speech, Language and Audio in Multimedia (SLAM 2013)

Marseille, France
August 22-23, 2013

Named Entity Recognition in Speech Transcripts following an Extended Taxonomy

Mohamed Hatmi (1), Christine Jacquin (1), Emmanuel Morin (1), Sylvain Meignier (2)

(1) LINA, University of Nantes, France; (2) LIUM, University of Le Mans, France

In this paper, we present a French named entity recognition (NER) system that was first developed as part of our participation in the ETAPE 2012 evaluation campaign and then extended to cover more entity types. The ETAPE 2012 evaluation campaign considers an hierarchical and compositional taxonomy that makes the NER task more complex. We present a multi-level methodology based on conditional random fields (CRFs). With respect to existing systems, our methodology allows a fine-grained annotation. Experiments were conducted using the manually annotated training and evaluation corpora provided by the organizers of the campaign. The obtained results are presented and discussed.

Index Terms: Named Entity Recognition, Structured Named Entities, CRF model.

Full Paper

Bibliographic reference.  Hatmi, Mohamed / Jacquin, Christine / Morin, Emmanuel / Meignier, Sylvain (2013): "Named entity recognition in speech transcripts following an extended taxonomy", In SLAM-2013, 61-65.