Sciweavers

17 search results - page 1 / 4
» Evolving Strategies for Focused Web Crawling
Sort
View
62
Voted
ICML
2003
IEEE
15 years 10 months ago
Evolving Strategies for Focused Web Crawling
Judy Johnson, Kostas Tsioutsiouliklis, C. Lee Gile...
CIKM
2011
Springer
13 years 9 months ago
Focusing on novelty: a crawling strategy to build diverse language models
Word prediction performed by language models has an important role in many tasks as e.g. word sense disambiguation, speech recognition, hand-writing recognition, query spelling an...
Luciano Barbosa, Srinivas Bangalore
WWW
2006
ACM
15 years 10 months ago
Geographically focused collaborative crawling
A collaborative crawler is a group of crawling nodes, in which each crawling node is responsible for a specific portion of the web. We study the problem of collecting geographical...
Weizheng Gao, Hyun Chul Lee, Yingbo Miao
SAC
2003
ACM
15 years 2 months ago
Ontology-Focused Crawling of Web Documents
The Web, the largest unstructured database of the world, has greatly improved access to documents. However, documents on the Web are largely disorganized. Due to the distributed n...
Marc Ehrig, Alexander Maedche
WIDM
2004
ACM
15 years 3 months ago
Probabilistic models for focused web crawling
A Focused crawler must use information gleaned from previously crawled page sequences to estimate the relevance of a newly seen URL. Therefore, good performance depends on powerfu...
Hongyu Liu, Evangelos E. Milios, Jeannette Janssen