Sciweavers

917 search results - page 24 / 184
» Name and Address Data Quality
Sort
View
ICDM
2005
IEEE
138views Data Mining» more  ICDM 2005»
15 years 5 months ago
Labeling Unclustered Categorical Data into Clusters Based on the Important Attribute Values
Sampling has been recognized as an important technique to improve the efficiency of clustering. However, with sampling applied, those points which are not sampled will not have t...
Hung-Leng Chen, Kun-Ta Chuang, Ming-Syan Chen
SIGMOD
2003
ACM
119views Database» more  SIGMOD 2003»
15 years 12 months ago
Robust and Efficient Fuzzy Match for Online Data Cleaning
To ensure high data quality, data warehouses must validate and cleanse incoming data tuples from external sources. In many situations, clean tuples must match acceptable tuples in...
Surajit Chaudhuri, Kris Ganjam, Venkatesh Ganti, R...
ICSE
2008
IEEE-ACM
16 years 18 days ago
Tool support for data validation by end-user programmers
End-user programming tools for creating spreadsheets and webforms offer no data types except "string" for storing many kinds of data, such as person names and street add...
Christopher Scaffidi, Brad A. Myers, Mary Shaw
CIKM
2011
Springer
13 years 11 months ago
Mining entity translations from comparable corpora: a holistic graph mapping approach
This paper addresses the problem of mining named entity translations from comparable corpora, specifically, mining English and Chinese named entity translation. We first observe...
Jinhan Kim, Long Jiang, Seung-won Hwang, Young-In ...
ISCAS
2005
IEEE
127views Hardware» more  ISCAS 2005»
15 years 5 months ago
Wire-driven microarchitectural design space exploration
— In this paper, we propose an interconnect-driven framework that performs an efficient and effective design space exploration for deep submicron processor architecture design. ...
Mongkol Ekpanyapong, Chinnakrishnan S. Ballapuram,...