Sciweavers

17 search results - page 2 / 4
» Automatically Generating Labeled Examples for Web Wrapper Ma...
Sort
View
AAAI
2000
13 years 6 months ago
Learning the Common Structure of Data
The proliferation of online information sources has accentuated the need for tools that automatically validate and recognize data. We present an efficient algorithm that learns st...
Kristina Lerman, Steven Minton
IRI
2009
IEEE
13 years 11 months ago
Ontology Guided Autonomous Label Assignment in Wrapper Induced Tables with Missing Column Names
Formulating and executing queries over distributed, autonomous and heterogeneous resources is an important research area. The advent of the Internet and the Web and their inherent...
Mohammad Shafkat Amin, Hasan M. Jamil
WWW
2005
ACM
14 years 5 months ago
Fully automatic wrapper generation for search engines
When a query is submitted to a search engine, the search engine returns a dynamically generated result page containing the result records, each of which usually consists of a link...
Hongkun Zhao, Weiyi Meng, Zonghuan Wu, Vijay Ragha...
ICWE
2009
Springer
13 years 11 months ago
A Layout-Independent Web News Article Contents Extraction Method Based on Relevance Analysis
Abstract. The traditional Web news article contents extraction methods are time-costly and need much maintenance because they analyze the layout of news pages to generate the wrapp...
Hao Han, Takehiro Tokuda
WWW
2005
ACM
14 years 5 months ago
Web data extraction based on partial tree alignment
This paper studies the problem of extracting data from a Web page that contains several structured data records. The objective is to segment these data records, extract data items...
Yanhong Zhai, Bing Liu