Sciweavers

874 search results - page 2 / 175
» Evaluation Methods for Focused Crawling
Sort
View
CN
1999
242views more  CN 1999»
13 years 4 months ago
Focused Crawling: A New Approach to Topic-Specific Web Resource Discovery
The rapid growth of the World-Wide Web poses unprecedented scaling challenges for general-purpose crawlers and search engines. In this paper we describe a new hypertext resource d...
Soumen Chakrabarti, Martin van den Berg, Byron Dom
IR
2008
13 years 4 months ago
Focused web crawling in the acquisition of comparable corpora
CLIR resources, such as dictionaries and parallel corpora, are scarce for special domains. Obtaining comparable corpora automatically for such domains could be an answer to this p...
Tuomas Talvensaari, Ari Pirkola, Kalervo Järv...
SAC
2003
ACM
13 years 10 months ago
Ontology-Focused Crawling of Web Documents
The Web, the largest unstructured database of the world, has greatly improved access to documents. However, documents on the Web are largely disorganized. Due to the distributed n...
Marc Ehrig, Alexander Maedche
WWW
2005
ACM
14 years 5 months ago
Focused crawling by exploiting anchor text using decision tree
Focused crawlers are considered as a promising way to tackle the scalability problem of topic-oriented or personalized search engines. To design a focused crawler, the choice of s...
Jun Li, Kazutaka Furuse, Kazunori Yamaguchi
WIDM
2004
ACM
13 years 10 months ago
Probabilistic models for focused web crawling
A Focused crawler must use information gleaned from previously crawled page sequences to estimate the relevance of a newly seen URL. Therefore, good performance depends on powerfu...
Hongyu Liu, Evangelos E. Milios, Jeannette Janssen