Sciweavers

27 search results - page 5 / 6
» A Heterogeneous Field Matching Method for Record Linkage
Sort
View
WWW
2005
ACM
14 years 7 months ago
Web data extraction based on partial tree alignment
This paper studies the problem of extracting data from a Web page that contains several structured data records. The objective is to segment these data records, extract data items...
Yanhong Zhai, Bing Liu
ICDE
2007
IEEE
99views Database» more  ICDE 2007»
14 years 7 months ago
Source-aware Entity Matching: A Compositional Approach
Entity matching (a.k.a. record linkage) plays a crucial role in integrating multiple data sources, and numerous matching solutions have been developed. However, the solutions have...
Warren Shen, Pedro DeRose, Long Vu, AnHai Doan, Ra...
NIPS
2004
13 years 7 months ago
Conditional Models of Identity Uncertainty with Application to Noun Coreference
Coreference analysis, also known as record linkage or identity uncertainty, is a difficult and important problem in natural language processing, databases, citation matching and m...
Andrew McCallum, Ben Wellner
DIAL
2004
IEEE
173views Image Analysis» more  DIAL 2004»
13 years 10 months ago
Citation Recognition for Scientific Publications in Digital Libraries
In this paper, a method based on part-of-speech tagging (PoS) is used for bibliographic reference structure. This method operates on a roughly structured ASCII file, produced by O...
Dominique Besagni, Abdel Belaïd
IJCAI
1997
13 years 7 months ago
Toward Structured Retrieval in Semi-structured Information Spaces
A semi-structured information space consists of multiple collections of textual documents containing fielded or tagged sections. The space can be highly heterogeneous, because eac...
Scott B. Huffman, Catherine Baudin