Sciweavers

232 search results - page 2 / 47
» Query-related data extraction of hidden web documents
Sort
View
ER
2001
Springer
148views Database» more  ER 2001»
13 years 9 months ago
On the Automatic Extraction of Data from the Hidden Web
An increasing amount of Web data is accessible only by filling out HTML forms to query an underlying data source. While this is most welcome from a user perspective (queries are e...
Stephen W. Liddle, Sai Ho Yau, David W. Embley
FLAIRS
2001
13 years 6 months ago
Syntactic Folding and its Application to the Information Extraction from Web Pages
Thepaper deals with investigations concerning potential structures of documentsthat will be subject to automated information extraction. The focus is on folding principles and the...
Jörg Herrmann
ISEC
2001
Springer
180views ECommerce» more  ISEC 2001»
13 years 9 months ago
i-Cube: A Tool-Set for the Dynamic Extraction and Integration of Web Data Content
Over the past decade the Internet has evolved into the largest public community in the world. It provides a wealth of data content and services in almost every field of science, t...
Frankie Poon, Kostas Kontogiannis
CIKM
2003
Springer
13 years 10 months ago
Extracting unstructured data from template generated web documents
We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...
Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Ch...
EMNLP
2007
13 years 6 months ago
Bootstrapping Information Extraction from Field Books
We present two machine learning approaches to information extraction from semi-structured documents that can be used if no annotated training data are available, but there does ex...
Sander Canisius, Caroline Sporleder