Sciweavers

ICAPR
2005
Springer

Combining Text and Link Analysis for Focused Crawling

13 years 9 months ago
Combining Text and Link Analysis for Focused Crawling
The number of vertical search engines and portals has rapidly increased over the last years, making the importance of a topic-driven (focused) crawler evident. In this paper, we develop a latent semantic indexing classifier that combines link analysis with text content in order to retrieve and index domain specific web documents. We compare its efficiency with other well-known web information retrieval techniques. Our implementation presents a different approach to focused crawling and aims to overcome the limitations of the neccesity to provide initial training data while maintaining a high recall/precision ratio.
George Almpanidis, Constantine Kotropoulos
Added 27 Jun 2010
Updated 27 Jun 2010
Type Conference
Year 2005
Where ICAPR
Authors George Almpanidis, Constantine Kotropoulos
Comments (0)