Ayuda
Ir al contenido

Dialnet


Corpus and exploitation tool: IULACT and bwanaNet

  • Autores: Jorge Vivaldi Palatresi
  • Localización: A survey of corpus-based research [Recurso electrónico] / Pascual Cantos Gómez (ed. lit.), Aquilino Sánchez Pérez (ed. lit.), 2009, ISBN 978-84-692-2198-3, págs. 224-239
  • Idioma: inglés
  • Enlaces
  • Resumen
    • Over the last decades corpus linguistics methods have found ever increasing use in almost all linguistics related studies, mainly due to their usefulness to get and validate results.

      The IULACT is a project from the Applied Linguistics Institute intended to compile a corpus of LSP texts. This corpus includes documents from a variety of domains, registers and languages. In contrast to other resources captured directly from the web, the texts of the IULACT have been selected in a supervised way, transformed to a clean SGML format and processed with a set of specific tools. The overall goal of the project is to provide not only an infrastructure to study LSP but also a resource for computational linguistic research.

      This article describes the IULACT corpus, its architecture as well its processing tools. It starts from its design criteria and analyzes all the necessary processing stages and related software tools. Finally bwanaNet, the corpus browser tool, is described.


Fundación Dialnet

Dialnet Plus

  • Más información sobre Dialnet Plus

Opciones de compartir

Opciones de entorno