Ayuda
Ir al contenido

Dialnet


Automatic classification of multi-word expressions in print dictionaries

  • Autores: Alexander Geyken, Jordan Boyd-Graber
  • Localización: Linguisticae investigationes: Revue internationale de linguistique française et de linguistique générale, ISSN 0378-4169, Tome 26, Fascicule 2, 2003, págs. 187-202
  • Idioma: inglés
  • Texto completo no disponible (Saber más ...)
  • Resumen
    • Summary This work demonstrates the assignment of multi-word expressions in print dictionaries to POS classes with minimal linguistic resources. In this application, 32,000 entries from the Wörterbuch der deutschen Idiomatik (H. Schemann 1993) were classified using an inductive description of POS sequences in conjunction with a Brill Tagger trained on manually tagged idiomatic entries. This process assigned categories to 86% of entries with 88% accuracy. This classification supplies a meaningful preprocessing step for further applications: the resulting POS-sequences for all idiomatic entries might be used for the automatic recognition of multi-word lexemes in unrestricted text.


Fundación Dialnet

Dialnet Plus

  • Más información sobre Dialnet Plus

Opciones de compartir

Opciones de entorno