Ir al contenido


Learning Lessons from Bilingual Corpora: Benefits for Machine Translation

  • Autores: Oliver Streiter, Irina Sagalova, Leonid Iomdin
  • Localización: International journal of corpus linguistics, ISSN 1384-6655, Vol. 5, Nº 2, 2000, págs. 199-230
  • Idioma: inglés
  • Resumen
    • The research described in this paper is rooted in the endeavors to combine the advantages of corpus-based and rule-based MT approaches in order to improve the performance of MT systems-most importantly, the quality of translation. The authors review the ongoing activities in the field and present a case study, which shows how translation knowledge can be drawn from parallel corpora and compiled into the lexicon of a rule-based MT system. These data are obtained with the help of three procedures: (1) identification of hence unknown one-word translations, (2) statistical rating of the known one-word translations, and (3) extraction of new translations of multiword expressions (MWEs) followed by compilation steps which create new rules for the MT engine. As a result, the lexicon is enriched with translation equivalents attested for different subject domains, which facilitates the tuning of the MT system to a specific subject domain and improves the quality and adequacy of translation.

Fundación Dialnet

Dialnet Plus

  • Más información sobre Dialnet Plus

Opciones de compartir

Opciones de entorno