Combining text and link analysis for focused crawling¿An application for vertical search engines

Autores: G. Almpanidis, C. Kotropoulos, I. Pitas
Localización: Information systems, ISSN 0306-4379, Vol. 32, Nº 6, 2007, págs. 886-908
Idioma: inglés
Texto completo no disponible (Saber más ...)
Resumen
- The number of vertical search engines and portals has rapidly increased over the last years, making the importance of a topic-driven (focused) crawler self-evident. In this paper, we develop a latent semantic indexing classifier that combines link analysis with text content in order to retrieve and index domain-specific web documents. Our implementation presents a different approach to focused crawling and aims to overcome the limitations imposed by the need to provide initial data for training, while maintaining a high recall/precision ratio. We compare its efficiency with other well-known web information retrieval techniques

Acceso de usuarios registrados

¿Es nuevo? Regístrese

Coordinado por: