International Workshop on Spoken Language Translation (IWSLT) 2011
San Francisco, CA, USA
In this paper, we propose structure transformation rules for statistical machine translation which are lexicalized by only function words. Although such rules can be extracted from an aligned parallel corpus simply as original phrase pairs, their structure is hierarchical and thus can be used in a hierarchical translation system. In addition, structure transformation rules can take into account long-distance reordering, allowing for more than two phrases to be moved simultaneously. The rule set is used as a core module in our hierarchical model together with two other modules, namely, a basic reordering module and an optional gap phrase module. Our model is considerably more compact and produces slightly higher BLEU scores than the original hierarchical phrase-based model in Japanese-English translation on the parallel corpus of the NTCIR-7 patent translation task.
Bibliographic reference. Ding, Chenchen / Inui, Takashi / Yamamoto, Mikio (2011): "Long-distance hierarchical structure transformation rules utilizing function words", In IWSLT-2011, 159-166.