International Workshop on Spoken Language Translation (IWSLT) 2011
San Francisco, CA, USA
This paper reports on the participation of FBK at the IWSLT 2011 Evaluation: namely in the English ASR track, the Arabic-English MT track and the English-French MT and SLT tracks. Our ASR system features acoustic models trained on a portion of the TED talk recordings that was automatically selected according to the fidelity of the provided transcriptions. Three decoding steps are performed interleaved by acoustic feature normalization and acoustic model adaptation. Concerning the MT and SLT systems, besides language specific pre-processing and the automatic introduction of punctuation in the ASR output, two major improvements are reported over our last year baselines. First, we applied a fill-up method for phrase-table adaptation; second, we explored the use of hybrid class-based language models to better capture the language style of public speeches.
Bibliographic reference. Ruiz, Nick / Bisazza, Arianna / Brugnara, F. / Falavigna, D. / Giuliani, D. / Jaber, S. / Gretter, R. / Federico, Marcello (2011): "FBK @ IWSLT 2011", In IWSLT-2011, 86-93.