International Workshop on Spoken Language Translation (IWSLT) 2010
This paper describes the MIRACL statistical Machine
Translation system and the improvements that were developed
during the IWSLT 2010 evaluation campaign. We participated
to the Arabic to English BTEC tasks using a phrase-based
statistical machine translation approach. In this paper, we first
discuss some challenges in translating from Arabic to English
and we explore various techniques to improve performances
on a such task.
Next, we present our solution for disambiguating the output of an Arabic morphological analyzer. In fact, The Arabic morphological analyzer used produces all possible morphological structures for each word, with an unique correct proposition. In this work we exploit the Arabic-English alignment to choose the correct segmented form and the correct morpho-syntactic features produced by our morphological analyzer.
Bibliographic reference. Khemakhem, Ines Turki / Jamoussi, Salma / Hamadou, Abdelmajid Ben (2010): "The MIRACL Arabic-English statistical machine translation system for IWSLT 2010", In IWSLT-2010, 119-125.