Sciweavers

94 search results - page 1 / 19
» Using the Structure of Web Sites for Automatic Segmentation ...
Sort
View
SIGMOD
2004
ACM
92views Database» more  SIGMOD 2004»
14 years 4 months ago
Using the Structure of Web Sites for Automatic Segmentation of Tables
Kristina Lerman, Lise Getoor, Steven Minton, Craig...
WWW
2005
ACM
14 years 5 months ago
Web data extraction based on partial tree alignment
This paper studies the problem of extracting data from a Web page that contains several structured data records. The objective is to segment these data records, extract data items...
Yanhong Zhai, Bing Liu
LWA
2008
13 years 6 months ago
Capturing the needs of amateur web designers by means of examples
Many sites are created by people who lack professional training in web design. We present `SiteGuide', a tool that helps amateur web designers to decide which information wil...
Vera Hollink, Viktor de Boer, Maarten van Someren
PVLDB
2008
141views more  PVLDB 2008»
13 years 4 months ago
WebTables: exploring the power of tables on the web
The World-Wide Web consists of a huge number of unstructured documents, but it also contains structured data in the form of HTML tables. We extracted 14.1 billion HTML tables from...
Michael J. Cafarella, Alon Y. Halevy, Daisy Zhe Wa...
VLDB
2004
ACM
121views Database» more  VLDB 2004»
13 years 10 months ago
An Automatic Data Grabber for Large Web Sites
We demonstrate a system to automatically grab data from data intensive web sites. The system first infers a model that describes at the intensional level the web site as a collec...
Valter Crescenzi, Giansalvatore Mecca, Paolo Meria...