Multilingual acquisition of large scale knowledge resources

Montserrat Cuadros Oller

Ayuda

Multilingual acquisition of large scale knowledge resources

Autores: Montserrat Cuadros Oller
Directores de la Tesis: Lluís Padró Cirera (dir. tes.), Germán Rigau Claramunt (dir. tes.)
Lectura: En la Universitat Politècnica de Catalunya (UPC) ( España ) en 2011
Idioma: inglés
Tribunal Calificador de la Tesis: Horacio Rodríguez Hontoria (presid.), Irene Castellón Masalles (secret.), Roberto Navigli (voc.), Arantza Díaz de Ilarraza Sánchez (voc.), Piek Vossen (voc.)
Materias:
- Matemáticas
  - Ciencia de los ordenadores
    - Inteligencia artificial
    - Informática
Texto completo no disponible (Saber más ...)
Resumen
- Natural Language Processing (NLP) is a subfield of Artificial Intelligence (AI) that attempts to automatically process human language. Nowadays, NLP systems seem to have reached an upper-bound using existing resources and techniques. There is a broad consensus in the research community that systems need to integrate larger amounts of semantic and world knowledge in order to improve the quality of the current results.
  
  Nevertheless, building adequate semantic resources is a very difficult and an open research problem. Many efforts have been devoted to build knowledge repositories in the past decades, producing a wide range of knowledge bases, which offer different levels of granularity or approach different aspects of knowledge representation. Among them, Princeton WordNet[Fellbaum98] (WN) is by far the most widely-used semantic resource in the NLP area.
  
  The main goal of the research presented in this thesis is to devise new methods and tools to automatically create new semantic relations between WordNet senses. That is, to accurately increase by automatic means the knowledge represented in WordNet.
  
  The proposed process uses the current content of WordNet as the {\it minimal} knowledge base required to start a cycling acquisition approach. First, the process acquires from corpora relevant terms associated to each WordNet sense. Second, the identification stage uses the knowledge present in WordNet to establish the appropriate sense of each of these terms, obtaining as a result large amounts of new semantic relations among WordNet.
  
  In particular, our research focuses on devising new methods and tools for:
  
  * Acquiring relevant words from general or domain corpora for an specific WordNet word-sense.
  
  * Identifying the {\it implicit} word-senses of the acquired relevant words with respect to an {\it existing} knowledge base (in particular, WordNet).
  
  * Empirically evaluating the quality of the resulting {\it new} semantic relations in a controlled multilingual evaluation framework.
  
  Thus, our research goals cover the automatic acquisition, identification, integration, and evaluation of large amounts of semantic relations among WordNet senses captured from general or domain-specific corpora. In this way, the resulting knowledge net or KnowNet (KN), should be an extensible, large, accurate and useful knowledge base, derived automatically from text collections.
  
  Furthermore, being represented at a semantic level, we also expect that the new semantic knowledge acquired from text in one language can be of utility in other languages.

Acceso de usuarios registrados

¿Olvidó su contraseña?

¿Es nuevo? Regístrese

Ventajas de registrarse

Dialnet Plus

Opciones de compartir

Opciones de entorno

Sugerencia / Errata

Coordinado por: