International Workshop on Spoken Language Translation (IWSLT) 2011

San Francisco, CA, USA
December 8-9, 2011

Long-Distance Hierarchical Structure Transformation Rules Utilizing Function Words

Chenchen Ding, Takashi Inui, Mikio Yamamoto

Department of Computer Science, University of Tsukuba, Japan

In this paper, we propose structure transformation rules for statistical machine translation which are lexicalized by only function words. Although such rules can be extracted from an aligned parallel corpus simply as original phrase pairs, their structure is hierarchical and thus can be used in a hierarchical translation system. In addition, structure transformation rules can take into account long-distance reordering, allowing for more than two phrases to be moved simultaneously. The rule set is used as a core module in our hierarchical model together with two other modules, namely, a basic reordering module and an optional gap phrase module. Our model is considerably more compact and produces slightly higher BLEU scores than the original hierarchical phrase-based model in Japanese-English translation on the parallel corpus of the NTCIR-7 patent translation task.

Full Paper

Bibliographic reference.  Ding, Chenchen / Inui, Takashi / Yamamoto, Mikio (2011): "Long-distance hierarchical structure transformation rules utilizing function words", In IWSLT-2011, 159-166.