Ayuda
Ir al contenido

Dialnet


The influence of corpora on lexicons: corpora use in the creation of COMLEX syntax and NOMLEX

  • Autores: Catherine MacLeod, Ralph Grishman
  • Localización: Proceedings of the Ninth EURALEX International Congress, EURALEX 2000: Stuttgart, Germany, August 8th - 12th, 2000 / Ulrich Heid (ed. lit.), Stefan Evert (ed. lit.), Egbert Lehmann (ed. lit.), Christian Rohrer (ed. lit.), 2000, págs. 141-148
  • Idioma: inglés
  • Enlaces
  • Resumen
    • It is now generally accepted that a text corpus plays an important role in the production of hard-copy dictionaries. In this paper, we discuss the influence a corpus can have on the creation of lexical resources for computer use. In the creation of COMLEX Syntax and NOMLEX, two on-line lexicons produced by the authors at New York University, we used two different corpora, one composed of a small (one million words) balanced corpus (the Brown Corpus) plus a large amount of newspaper data and the other, a large balanced corpus (100 million words) of British English (the British National Corpus). We point out how the use of these two corpora affected the resulting lexicons in different ways and to differing degrees and we suggest what we feel would have been the ideal corpus for our purposes.


Fundación Dialnet

Dialnet Plus

  • Más información sobre Dialnet Plus

Opciones de compartir

Opciones de entorno