Sciweavers

ISCI
2000

Automating the approximate record-matching process

13 years 4 months ago
Automating the approximate record-matching process
Data Quality has many dimensions one of which is accuracy. Accuracy is usually compromised by errors accidentally or intensionally introduced in a database system. These errors result in inconsistent, incomplete, or erroneous data elements. For example, a small variation in the representation of a data object, produces a unique instantiation of the object being represented. In order to improve the accuracy of the data stored in a database system, we need to compare them either with real-world counterparts or with other data stored in the same or a dierent system. In this paper we address the problem of matching records which refer to the same entity by computing their similarity. Exact record matching has limited applicability in this context since even simple errors like character transpositions cannot be captured in the record linking process. Our methodology deploys advanced data mining techniques for dealing with the high computational and inferential complexity of approximate rec...
Vassilios S. Verykios, Ahmed K. Elmagarmid, Elias
Added 18 Dec 2010
Updated 18 Dec 2010
Type Journal
Year 2000
Where ISCI
Authors Vassilios S. Verykios, Ahmed K. Elmagarmid, Elias N. Houstis
Comments (0)