Cross-lingual keyword assignment

Please use this identifier to cite or link to this item: http://hdl.handle.net/10045/1820
Información del item - Informació de l'item - Item information
Title: Cross-lingual keyword assignment
Authors: Steinberger, Ralf
Keywords: Controlled vocabulary | Keyword assignment | EUROVOC thesaurus | Multilingual
Issue Date: Sep-2001
Publisher: Sociedad Española para el Procesamiento del Lenguaje Natural
Citation: STEINBERGER, Ralf. “Cross-lingual keyword assignment”. Procesamiento del lenguaje natural. Nº 27 (sept. 2001), pp. 273-280
Abstract: This paper presents a language-independent approach to controlled vocabulary keyword assignment using the EUROVOC thesaurus. Due to the multilingual nature of EUROVOC, the keywords for a document written in one language can be displayed in all eleven official European Union languages. The mapping of documents written in different languages to the same multilingual thesaurus furthermore allows cross-language document comparison. The assignment of the controlled vocabulary thesaurus descriptors is achieved by applying a statistical method that uses a collection of manually indexed documents to identify, for each thesaurus descriptor, a large number of lemmas that are statistically associated to the descriptor. These associated words are then used during the assignment procedure to identify a ranked list of those EUROVOC terms that are most likely to be good keywords for a given document. The paper also describes the challenges of this task and discusses the achieved results of the fully functional prototype.
URI: http://hdl.handle.net/10045/1820
ISSN: 1135-5948
Language: eng
Type: info:eu-repo/semantics/article
Appears in Collections:Procesamiento del Lenguaje Natural - Nº 27 (septiembre 2001)

Files in This Item:
Files in This Item:
File Description SizeFormat 
ThumbnailPLN_27_32.pdf46,27 kBAdobe PDFOpen Preview


Items in RUA are protected by copyright, with all rights reserved, unless otherwise indicated.