Speech Prosody 2010
Chicago, IL, USA
This article reports the process of building a bilingual (Spanish-Catalan) text corpus balanced in parallel taking into account prosodic features for both languages. We propose an expert guideline for text manipulation that in combination with greedy algorithms significantly improves the quality of the selected corpus. The application of this methodology to a radio news corpus empirically supports the proposed strategy.
Bibliographic reference. Escudero-Mancebo, David / González-Ferreras, C. / Garrido Almiñana, Juan María / Rodero, E. / Aguilar, Lourdes / Bonafonte, Antonio (2010): "Combining greedy algorithms with expert guided manipulation for the definition of a balanced prosodic Spanish-catalan radio news corpus", In SP-2010, paper 061.