Evaluating RBMT output for -ing formsA study of four tar-get languages

  1. Nora Aranberri Monasterio 1
  2. Sharon O'Brien 1
  1. 1 Dublin City University
    info

    Dublin City University

    Dublín, Irlanda

    ROR https://ror.org/04a1a1e81

Revista:
Linguistica Antverpiensia

ISSN: 0304-2294

Año de publicación: 2009

Número: 8

Páginas: 105-122

Tipo: Artículo

Otras publicaciones en: Linguistica Antverpiensia

Resumen

-ing forms in English are reported to be problematic for Machine Transla-tion and are often the focus of rules in Controlled Language rule sets. We investigated how problematic -ing forms are for an RBMT system, translat-ing into four target languages in the IT domain. Constituent-based human evaluation was used and the results showed that, in general, -ing forms do not deserve their bad reputation. A comparison with the results of five automated MT evaluation metrics showed promising correlations. Some issues prevail, however, and can vary from target language to target lan-guage. We propose different strategies for dealing with these problems, such as Controlled Language rules, semi-automatic post-editing, source text tagging and “post-editing” the source text.