Sciweavers

36 search results - page 7 / 8
» Distribution-Based Synthetic Database Generation Techniques ...
Sort
View
87
Voted
AUSDM
2007
Springer
102views Data Mining» more  AUSDM 2007»
15 years 2 months ago
A Two-Step Classification Approach to Unsupervised Record Linkage
Linking or matching databases is becoming increasingly important in many data mining projects, as linked data can contain information that is not available otherwise, or that woul...
Peter Christen
SACMAT
2009
ACM
15 years 4 months ago
Evaluating role mining algorithms
While many role mining algorithms have been proposed in recent years, there lacks a comprehensive study to compare these algorithms. These role mining algorithms have been evaluat...
Ian Molloy, Ninghui Li, Tiancheng Li, Ziqing Mao, ...
95
Voted
SAC
2010
ACM
15 years 5 months ago
Background knowledge in formal concept analysis: constraints via closure operators
The aim of this short paper is to present a general method of using background knowledge to impose constraints in conceptual clustering of object-attribute relational data. The pr...
Radim Belohlávek, Vilém Vychodil
ICDE
1999
IEEE
183views Database» more  ICDE 1999»
15 years 11 months ago
ROCK: A Robust Clustering Algorithm for Categorical Attributes
Clustering, in data mining, is useful to discover distribution patterns in the underlying data. Clustering algorithms usually employ a distance metric based (e.g., euclidean) simi...
Sudipto Guha, Rajeev Rastogi, Kyuseok Shim
96
Voted
DAWAK
2005
Springer
15 years 3 months ago
DWEB: A Data Warehouse Engineering Benchmark
Abstract. Data warehouse architectural choices and optimization techniques are critical to decision support query performance. To facilitate these choices, the performance of the d...
Jérôme Darmont, Omar Boussaid, Fadila...