International Workshop on Spoken Language Translation (IWSLT) 2007
In this paper, we describe the system and approach used by
Institute for Infocomm Research (I2R) for the IWSLT 2007
spoken language evaluation campaign. A multi-pass approach
is exploited to generate and select best translation. First, we use
two decoders namely the open source Moses and an in-home
syntax-based decoder to generate N-best lists. Next we spawn
new translation entries through a word-based n-gram language
model estimated on the former N-best entries. Finally, we join
the N-best lists from the previous two passes, and select the
best translation by rescoring them with additional feature
In particular, this paper reports our effort on new translation entry generation and system combination. The performance on development and test sets are reported. The system was ranked first with respect to the BLEU measure in Chinese-to-English open data track.
Full Paper Presentation
Bibliographic reference. Chen, Boxing / Sun, Jun / Jiang, Hongfei / Zhang, Min / Aw, Ai Ti (2007): "I2r Chinese-English translation system for IWSLT 2007", In IWSLT-2007, 55-60.