Sciweavers

146 search results - page 11 / 30
» RoadRunner: Towards Automatic Data Extraction from Large Web...
Sort
View
COMPSAC
2003
IEEE
15 years 2 months ago
A Supervised Visual Wrapper Generator for Web-Data Extraction
Extracting data from Web pages using wrappers is a fundamental problem arising in a large variety of applications of vast practical interest. In this paper, we propose a novel sch...
Xiaofeng Meng, Haiyan Wang, Dongdong Hu, Chen Li
KDD
2012
ACM
212views Data Mining» more  KDD 2012»
13 years 2 days ago
Harnessing the wisdom of the crowds for accurate web page clipping
Clipping Web pages, namely extracting the informative clips (areas) from Web pages, has many applications, such as Web printing and e-reading on small handheld devices. Although m...
Lei Zhang, Linpeng Tang, Ping Luo, Enhong Chen, Li...
SIGMOD
2002
ACM
188views Database» more  SIGMOD 2002»
15 years 9 months ago
COMMIX: towards effective web information extraction, integration and query answering
As WWW becomes more and more popular and powerful, how to search information on the web in database way becomes an important research topic. COMMIX, which is developed in the DB g...
Tengjiao Wang, Shiwei Tang, Dongqing Yang, Jun Gao...
103
Voted
WEBDB
2009
Springer
149views Database» more  WEBDB 2009»
15 years 4 months ago
Extracting Route Directions from Web Pages
Linguists and geographers are more and more interested in route direction documents because they contain interesting motion descriptions and language patterns. A large number of s...
Xiao Zhang, Prasenjit Mitra, Sen Xu, Anuj R. Jaisw...
87
Voted
EMNLP
2009
14 years 7 months ago
Toward Completeness in Concept Extraction and Classification
Many algorithms extract terms from text together with some kind of taxonomic classification (is-a) link. However, the general approaches used today, and specifically the methods o...
Eduard H. Hovy, Zornitsa Kozareva, Ellen Riloff