International Workshop on Spoken Language Translation (IWSLT) 2010
This paper describes NICT's participation in the IWSLT
2010 evaluation campaign for the DIALOG translation
(Chinese-English) and the BTEC (French-English) translation
For the DIALOG translation, the main challenge to this task is applying context information during translation. Context information can be used to decide on word choice and also to replace missing information during translation. We applied discriminative reranking using contextual information as additional features. In order to provide more choices for re-ranking, we generated n-best lists from multiple phrase-based statistical machine translation systems that varied in the type of Chinese word segmentation schemes used. We also built a model that merged the phrase tables generated by the different segmentation schemes. Furthermore, we used a lattice-based system combination model to combine the output from different systems. A combination of all of these systems was used to produce the n-best lists for re-ranking.
For the BTEC task, a general approach that used latticebased system combination of two systems, a standard phrasebased system and a hierarchical phrase-based system, was taken. We also tried to process some unknown words by replacing them with the same words but different inflections that are known to the system.
Bibliographic reference. Goh, Chooi-Ling / Watanabe, Taro / Paul, Michael / Finch, Andrew / Sumita, Eiichiro (2010): "The NICT translation system for IWSLT 2010", In IWSLT-2010, 139-146.