Sciweavers

171 search results - page 1 / 35
» Focused crawling: experiences in a real world project
Sort
View
WWW
2006
ACM
14 years 6 months ago
Focused crawling: experiences in a real world project
Antonio Badia, Tulay Muezzinoglu, Olfa Nasraoui
CN
1999
242views more  CN 1999»
13 years 5 months ago
Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery
The rapid growth of the World-Wide Web poses unprecedented scaling challenges for general-purpose crawlers and search engines. In this paper we describe a new hypertext resource d...
Soumen Chakrabarti, Martin van den Berg, Byron Dom
WWW
2005
ACM
14 years 6 months ago
Focused crawling by exploiting anchor text using decision tree
Focused crawlers are considered as a promising way to tackle the scalability problem of topic-oriented or personalized search engines. To design a focused crawler, the choice of s...
Jun Li, Kazutaka Furuse, Kazunori Yamaguchi
ICWSM
2010
13 years 3 months ago
Coping With Noise in a Real-World Weblog Crawler and Retrieval System
In this paper we examine the effects of noise when creating a real-world weblog corpus for information retrieval. We focus on the DiffPost (Lee et al. 2008) approach to noise remo...
James Lanagan, Paul Ferguson, Neil O'Hare, Alan F....
IAT
2009
IEEE
14 years 3 days ago
Intelligent Crawling in Virtual Worlds
—We present an intelligent agent crawler designed to collect user-generated content in Second Life and related virtual worlds. The agents navigate autonomously through the world ...
Josh Eno, Susan Gauch, Craig W. Thompson