Sciweavers

236 search results - page 3 / 48
» Automatic Sales Lead Generation from Web Data
Sort
View
CIKM
2003
Springer
13 years 10 months ago
Extracting unstructured data from template generated web documents
We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...
Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Ch...
ER
2001
Springer
148views Database» more  ER 2001»
13 years 10 months ago
On the Automatic Extraction of Data from the Hidden Web
An increasing amount of Web data is accessible only by filling out HTML forms to query an underlying data source. While this is most welcome from a user perspective (queries are e...
Stephen W. Liddle, Sai Ho Yau, David W. Embley
ICML
2007
IEEE
14 years 6 months ago
Dynamic hierarchical Markov random fields and their application to web data extraction
Hierarchical models have been extensively studied in various domains. However, existing models assume fixed model structures or incorporate structural uncertainty generatively. In...
Jun Zhu, Zaiqing Nie, Bo Zhang, Ji-Rong Wen
VLDB
2004
ACM
121views Database» more  VLDB 2004»
13 years 10 months ago
An Automatic Data Grabber for Large Web Sites
We demonstrate a system to automatically grab data from data intensive web sites. The system first infers a model that describes at the intensional level the web site as a collec...
Valter Crescenzi, Giansalvatore Mecca, Paolo Meria...
WEBI
2005
Springer
13 years 11 months ago
Automatically Generating Labeled Examples for Web Wrapper Maintenance
In order to let software programs gain full benefit from semi-structured web sources, wrapper programs must be built to provide a “machine-readable” view over them. A signific...
Juan Raposo, Alberto Pan, Manuel Álvarez, J...