Search Sciweavers | Sciweavers

48 search results - page 7 / 10

» A Comparison of Techniques for Sampling Web Pages

192

click to vote

ICDM
2008
IEEE

186views Data Mining» more ICDM 2008»

xCrawl: A High-Recall Crawling Method for Web Mining

16 years 8 days ago

Download ls13-www.cs.uni-dortmund.de

Web Mining Systems exploit the redundancy of data published on the Web to automatically extract information from existing web documents. The ﬁrst step in the Information Extract...

Kostyantyn M. Shchekotykhin, Dietmar Jannach, Gerh...

claim paper

Read More »

160

click to vote

VLDB
2011
ACM

251views Database» more VLDB 2011»

Harvesting relational tables from lists on the web

15 years 23 days ago

Download www.vldb.org

A large number of web pages contain data structured in the form of “lists”. Many such lists can be further split into multi-column tables, which can then be used in more seman...

Hazem Elmeleegy, Jayant Madhavan, Alon Y. Halevy

claim paper

Read More »

175

click to vote

CIKM
2009
Springer

140views Information Technology» more CIKM 2009»

Compact full-text indexing of versioned document collections

16 years 12 days ago

Download cis.poly.edu

We study the problem of creating highly compressed fulltext index structures for versioned document collections, that is, collections that contain multiple versions of each docume...

Jinru He, Hao Yan, Torsten Suel

claim paper

Read More »

185

click to vote

IJSI
2008

115views more IJSI 2008»

Towards Knowledge Acquisition from Semi-Structured Content

15 years 5 months ago

Download www.ijsi.org

Abstract A rich family of generic Information Extraction (IE) techniques have been developed by researchers nowadays. This paper proposes WebKER, a system for automatically extract...

Xi Bai, Jigui Sun, Haiyan Che, Lian Shi

claim paper

Read More »

139

click to vote

COMCOM
2007

84views more COMCOM 2007»

A user-focused evaluation of web prefetching algorithms

15 years 5 months ago

Download www.gii.upv.es

Web prefetching mechanisms have been proposed to beneﬁt web users by hiding the download latencies. Nevertheless, to the knowledge of the authors, there is no attempt to compare...

Josep Domènech, Ana Pont, Julio Sahuquillo,...

claim paper

Read More »

« Prev « First page 7 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers