Sciweavers

609 search results - page 30 / 122
» Adaptive record extraction from web pages
Sort
View
KDD
2002
ACM
148views Data Mining» more  KDD 2002»
16 years 2 months ago
Discovering informative content blocks from Web documents
In this paper, we propose a new approach to discover informative contents from a set of tabular documents (or Web pages) of a Web site. Our system, InfoDiscoverer, first partition...
Shian-Hua Lin, Jan-Ming Ho
KDD
2012
ACM
212views Data Mining» more  KDD 2012»
13 years 4 months ago
Harnessing the wisdom of the crowds for accurate web page clipping
Clipping Web pages, namely extracting the informative clips (areas) from Web pages, has many applications, such as Web printing and e-reading on small handheld devices. Although m...
Lei Zhang, Linpeng Tang, Ping Luo, Enhong Chen, Li...
WWW
2011
ACM
14 years 9 months ago
HyLiEn: a hybrid approach to general list extraction on the web
We consider the problem of automatically extracting general lists from the web. Existing approaches are mostly dependent upon either the underlying HTML markup or the visual struc...
Fabio Fumarola, Tim Weninger, Rick Barber, Donato ...
CORR
2004
Springer
128views Education» more  CORR 2004»
15 years 1 months ago
Unsupervised Topic Adaptation for Lecture Speech Retrieval
We are developing a cross-media information retrieval system, in which users can view specific segments of lecture videos by submitting text queries. To produce a text index, the ...
Atsushi Fujii, Katunobu Itou, Tomoyosi Akiba, Tets...
WISE
2010
Springer
14 years 11 months ago
Towards Flexible Mashup of Web Applications Based on Information Extraction and Transfer
Mashup combines information or functionality from two or more existing Web sources to create a new Web page or application. The Web sources that are used to build mashup applicatio...
Junxia Guo, Hao Han, Takehiro Tokuda