Sciweavers

609 search results - page 3 / 122
» Adaptive record extraction from web pages
Sort
View
EJC
2007
15 years 1 months ago
A Personal Web Information/Knowledge Retrieval System
The Web is the richest source of information and knowledge. Unfortunately the current structure of Web pages makes it difficult for users to retrieve the information or knowledge ...
Hao Han, Takehiro Tokuda
WWW
2005
ACM
16 years 9 days ago
Web data extraction based on partial tree alignment
This paper studies the problem of extracting data from a Web page that contains several structured data records. The objective is to segment these data records, extract data items...
Yanhong Zhai, Bing Liu
AAAI
2006
15 years 1 months ago
Automatic Wrapper Generation Using Tree Matching and Partial Tree Alignment
This paper is concerned with the problem of structured data extraction from Web pages. The objective of the research is to automatically segment data records in a page, extract da...
Yanhong Zhai, Bing Liu
APWEB
2003
Springer
15 years 4 months ago
Extracting Content Structure for Web Pages Based on Visual Representation
Abstract. A new web content structure based on visual representation is proposed in this paper. Many web applications such as information retrieval, information extraction and auto...
Deng Cai, Shipeng Yu, Ji-Rong Wen, Wei-Ying Ma
AAAI
2008
15 years 1 months ago
An Unsupervised Approach for Product Record Normalization across Different Web Sites
An unsupervised probabilistic learning framework for normalizing product records across different retailer Web sites is presented. Our framework decomposes the problem into two ta...
Tak-Lam Wong, Tik-Shun Wong, Wai Lam