Sciweavers

820 search results - page 22 / 164
» Deep web data extraction
Sort
View
WWW
2009
ACM
15 years 10 months ago
Incorporating site-level knowledge to extract structured data from web forums
Web forums have become an important data resource for many web applications, but extracting structured data from unstructured web forum pages is still a challenging task due to bo...
Jiang-Ming Yang, Rui Cai, Yida Wang, Jun Zhu, Lei ...
66
Voted
CIKM
2003
Springer
15 years 2 months ago
Extracting unstructured data from template generated web documents
We propose a novel approach that identifies web page templates and extracts the unstructured data. Extracting only the body of the page and eliminating the template increases the ...
Ling Ma, Nazli Goharian, Abdur Chowdhury, Misun Ch...
PKDD
2007
Springer
143views Data Mining» more  PKDD 2007»
15 years 3 months ago
Using the Web to Reduce Data Sparseness in Pattern-Based Information Extraction
Textual patterns have been used effectively to extract information from large text collections. However they rely heavily on textual redundancy in the sense that facts have to be m...
Sebastian Blohm, Philipp Cimiano
SIGMOD
2003
ACM
190views Database» more  SIGMOD 2003»
15 years 2 months ago
Extracting Structured Data from Web Pages
Many web sites contain large sets of pages generated using a common template or layout. For example, Amazon lays out the author, title, comments, etc. in the same way in all its b...
Arvind Arasu, Hector Garcia-Molina
SIGMOD
2002
ACM
118views Database» more  SIGMOD 2002»
15 years 9 months ago
A Brief Survey of Web Data Extraction Tools
Alberto H. F. Laender, Berthier A. Ribeiro-Neto, A...