International Workshop on Spoken Language Translation (IWSLT) 2010

Paris, France
December 2-3, 2010

Factor Templates for Factored Machine Translation Models

Yvette Graham, Josef van Genabith

National Centre for Language Technology, Dublin City University, Dublin, Ireland

In this paper, we present a method of avoiding the combinatorial explosion encountered in Factored Models during the construction of translation options caused by the large number of possible combinations of target language lemmas and morpho-syntactic factors. We automatically extract factor templates froma word-aligned annotated bilingual corpus and use them to distinguish which morpho-syntactic factors should be translated separately from lemmas and in doing so avoid the large number of translation options otherwise considered for generation. Besides Phrase-Based SMT, FactoredModels can be applied to SMT via deep syntactic transfer, which is the focus of our work. We therefore include an experimental evaluation of our method for a SMT via deep syntactic transfer system, comparing the baseline standard Factored Model with one that uses factor templates for translating morpho-syntactic factors, resulting in a large increase in BLEU score.

Full Paper

Bibliographic reference.  Graham, Yvette / Genabith, Josef van (2010): "Factor templates for factored machine translation models", In IWSLT-2010, 275-282.