Sciweavers

299 search results - page 5 / 60
» User-centric Web crawling
Sort
View
ICDM
2008
IEEE
186views Data Mining» more  ICDM 2008»
14 years 13 days ago
xCrawl: A High-Recall Crawling Method for Web Mining
Web Mining Systems exploit the redundancy of data published on the Web to automatically extract information from existing web documents. The first step in the Information Extract...
Kostyantyn M. Shchekotykhin, Dietmar Jannach, Gerh...
COOPIS
2004
IEEE
13 years 9 months ago
Minimizing the Network Distance in Distributed Web Crawling
Abstract. Distributed crawling has shown that it can overcome important limitations of the centralized crawling paradigm. However, the distributed nature of current distributed cra...
Odysseas Papapetrou, George Samaras
INTR
2002
50views more  INTR 2002»
13 years 5 months ago
Methodologies for crawler based Web surveys
There have been many attempts to study the content of the web, either through human or automatic agents. Five different previously used web survey methodologies are described and ...
Mike Thelwall
WWW
2006
ACM
14 years 6 months ago
Geographically focused collaborative crawling
A collaborative crawler is a group of crawling nodes, in which each crawling node is responsible for a specific portion of the web. We study the problem of collecting geographical...
Weizheng Gao, Hyun Chul Lee, Yingbo Miao
DMKD
2004
ACM
121views Data Mining» more  DMKD 2004»
13 years 9 months ago
Discovery of ads web hosts through traffic data analysis
One of the most actual problems on web crawling
V. Bacarella, Fosca Giannotti, Mirco Nanni, Dino P...