Search Sciweavers | Sciweavers

609 search results - page 9 / 122

» Adaptive record extraction from web pages

128

click to vote

SYNASC
2006
IEEE

211views Algorithms» more SYNASC 2006»

HTML Pattern Generator--Automatic Data Extraction from Web Pages

15 years 7 months ago

Download www.informatik.tu-cottbus.de

Existing methods of information extraction from HTML documents include manual approach, supervised learning and automatic techniques. The manual method has high precision and reca...

Mirel Cosulschi, Adrian Giurca, Bogdan Udrescu, Ni...

claim paper

Read More »

132

click to vote

WEBDB
2009
Springer

149views Database» more WEBDB 2009»

Extracting Route Directions from Web Pages

15 years 8 months ago

Download webdb09.cse.buffalo.edu

Linguists and geographers are more and more interested in route direction documents because they contain interesting motion descriptions and language patterns. A large number of s...

Xiao Zhang, Prasenjit Mitra, Sen Xu, Anuj R. Jaisw...

claim paper

Read More »

129

click to vote

WWW
2001
ACM

187views Internet Technology» more WWW 2001»

IEPAD: information extraction based on pattern discovery

16 years 2 months ago

Download www10.org

The research in information extraction (IE) regards the generation of wrappers that can extract particular information from semistructured Web documents. Similar to compiler gener...

Chia-Hui Chang, Shao-Chen Lui

claim paper

Read More »

108

Voted

SOFSEM
2007
Springer

156views Theoretical Computer Science» more SOFSEM 2007»

Creating Permanent Test Collections of Web Pages for Information Extraction Research

15 years 8 months ago

Download www.dbai.tuwien.ac.at

In the research area of automatic web information extraction, there is a need for permanent and annotated web page collections enabling objective performance evaluation of differen...

Bernhard Pollak, Wolfgang Gatterbauer

claim paper

Read More »

197

Voted

ICDE
2004
IEEE

117views Database» more ICDE 2004»

Probe, Cluster, and Discover: Focused Extraction of QA-Pagelets from the Deep Web

16 years 3 months ago

Download www.cc.gatech.edu

In this paper, we introduce the concept of a QA-Pagelet to refer to the content region in a dynamic page that contains query matches. We present THOR, a scalable and efficient min...

James Caverlee, Ling Liu, David Buttler

claim paper

Read More »

« Prev « First page 9 / 122 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers