Sciweavers

9 search results - page 1 / 2
» cleandb 2006
Sort
View
CLEANDB
2006
ACM
128views Database» more  CLEANDB 2006»
13 years 8 months ago
Structure Aware XML Object Identification
Diego Milano, Monica Scannapieco, Tiziana Catarci
SIGMOD
2007
ACM
183views Database» more  SIGMOD 2007»
14 years 4 months ago
Report on the First International VLDB Workshop on Clean Databases (CleanDB 2006)
In this report, we provide a summary1 of the First Int'l VLDB Workshop on Clean Databases (CleanDB 2006), which took place at Seoul, Korea, on September 11, 2006, in conjunct...
Dongwon Lee, Chen Li
CLEANDB
2006
ACM
145views Database» more  CLEANDB 2006»
13 years 10 months ago
Cleansing Databases of Misspelled Proper Nouns
The paper presents a data cleansing technique for string databases. We propose and evaluate an algorithm that identifies a group of strings that consists of (multiple) occurrence...
Arturas Mazeika, Michael H. Böhlen
CLEANDB
2006
ACM
112views Database» more  CLEANDB 2006»
13 years 8 months ago
Generic Entity Resolution with Data Confidences
We consider the Entity Resolution (ER) problem (also known as deduplication, or merge-purge), in which records determined to represent the same real-world entity are successively ...
David Menestrina, Omar Benjelloun, Hector Garcia-M...
CLEANDB
2006
ACM
163views Database» more  CLEANDB 2006»
13 years 10 months ago
Circumventing Data Quality Problems Using Multiple Join Paths
We propose the Multiple Join Path (MJP) framework for obtaining high quality information by linking fields across multiple databases, when the underlying databases have poor qual...
Yannis Kotidis, Amélie Marian, Divesh Sriva...