Search Sciweavers | Sciweavers

20

ICDM
2008
IEEE

186views Data Mining» more ICDM 2008»

xCrawl: A High-Recall Crawling Method for Web Mining

14 years 13 days ago

Web Mining Systems exploit the redundancy of data published on the Web to automatically extract information from existing web documents. The ﬁrst step in the Information Extract...

Kostyantyn M. Shchekotykhin, Dietmar Jannach, Gerh...

claim paper

Read More »

22

click to vote

COOPIS
2004
IEEE

108views Information Technology» more COOPIS 2004»

Minimizing the Network Distance in Distributed Web Crawling

13 years 9 months ago

Download softsys.cs.uoi.gr

Abstract. Distributed crawling has shown that it can overcome important limitations of the centralized crawling paradigm. However, the distributed nature of current distributed cra...

Odysseas Papapetrou, George Samaras

claim paper

Read More »

16

click to vote

INTR
2002

50views more INTR 2002»

Methodologies for crawler based Web surveys

13 years 5 months ago

Download cybermetrics.wlv.ac.uk

There have been many attempts to study the content of the web, either through human or automatic agents. Five different previously used web survey methodologies are described and ...

Mike Thelwall

claim paper

Read More »

15

click to vote

WWW
2006
ACM

138views Internet Technology» more WWW 2006»

Geographically focused collaborative crawling

14 years 6 months ago

Download www2006.org

A collaborative crawler is a group of crawling nodes, in which each crawling node is responsible for a specific portion of the web. We study the problem of collecting geographical...

Weizheng Gao, Hyun Chul Lee, Yingbo Miao

claim paper

Read More »

13

click to vote

DMKD
2004
ACM

121views Data Mining» more DMKD 2004»

Discovery of ads web hosts through traffic data analysis

13 years 9 months ago

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers