Search Sciweavers | Sciweavers

2677 search results - page 10 / 536

» Extracting Structured Data from Web Pages

152

click to vote

AAAI
2000

116views Intelligent Agents» more AAAI 2000»

Learning the Common Structure of Data

15 years 8 months ago

Download blondie.cs.byu.edu

The proliferation of online information sources has accentuated the need for tools that automatically validate and recognize data. We present an efficient algorithm that learns st...

Kristina Lerman, Steven Minton

claim paper

Read More »

154

Voted

CN
2007

108views more CN 2007»

On the peninsula phenomenon in web graph and its implications on web search

15 years 6 months ago

Download net.pku.edu.cn

Web masters usually place certain web pages such as home pages and index pages in front of others. Under such a design, it is necessary to go through some pages to reach the desti...

Tao Meng, Hong-Fei Yan

claim paper

Read More »

155

click to vote

KES
2006
Springer

137views Information Technology» more KES 2006»

Web Site Off-Line Structure Reconfiguration: A Web User Browsing Analysis

15 years 6 months ago

Download wi.dii.uchile.cl

The correct web site text content must be help to the visitors to find what they are looking for. However, the reality is quite different, many times the web page text content is a...

Sebastián A. Ríos, Juan D. Vel&aacut...

claim paper

Read More »

174

click to vote

VLDB
2004
ACM

121views Database» more VLDB 2004»

An Automatic Data Grabber for Large Web Sites

16 years 1 days ago

Download www.vldb.org

We demonstrate a system to automatically grab data from data intensive web sites. The system ﬁrst infers a model that describes at the intensional level the web site as a collec...

Valter Crescenzi, Giansalvatore Mecca, Paolo Meria...

claim paper

Read More »

271

Voted

ICDE
2004
IEEE

117views Database» more ICDE 2004»

Probe, Cluster, and Discover: Focused Extraction of QA-Pagelets from the Deep Web

16 years 8 months ago

Download www.cc.gatech.edu

In this paper, we introduce the concept of a QA-Pagelet to refer to the content region in a dynamic page that contains query matches. We present THOR, a scalable and efficient min...

James Caverlee, Ling Liu, David Buttler

claim paper

Read More »

« Prev « First page 10 / 536 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers