Sciweavers

2497 search results - page 182 / 500
» A Partial-Repeatability Approach to Data Mining
Sort
View
187
Voted
ICDE
2008
IEEE
195views Database» more  ICDE 2008»
15 years 11 months ago
LOCUST: An Online Analytical Processing Framework for High Dimensional Classification of Data Streams
Abstract-- In recent years, data streams have become ubiquitous because of advances in hardware and software technology. The ability to adapt conventional mining problems to data s...
Charu C. Aggarwal, Philip S. Yu
125
Voted
KDD
2007
ACM
178views Data Mining» more  KDD 2007»
15 years 10 months ago
Density-based clustering for real-time stream data
Existing data-stream clustering algorithms such as CluStream are based on k-means. These clustering algorithms are incompetent to find clusters of arbitrary shapes and cannot hand...
Yixin Chen, Li Tu
KDD
2003
ACM
109views Data Mining» more  KDD 2003»
15 years 10 months ago
Generative model-based clustering of directional data
High dimensional directional data is becoming increasingly important in contemporary applications such as analysis of text and gene-expression data. A natural model for multivaria...
Arindam Banerjee, Inderjit S. Dhillon, Joydeep Gho...
79
Voted
KDD
2002
ACM
157views Data Mining» more  KDD 2002»
15 years 10 months ago
Exploiting unlabeled data in ensemble methods
An adaptive semi-supervised ensemble method, ASSEMBLE, is proposed that constructs classification ensembles based on both labeled and unlabeled data. ASSEMBLE alternates between a...
Kristin P. Bennett, Ayhan Demiriz, Richard Maclin
76
Voted
ICDM
2005
IEEE
138views Data Mining» more  ICDM 2005»
15 years 3 months ago
Labeling Unclustered Categorical Data into Clusters Based on the Important Attribute Values
Sampling has been recognized as an important technique to improve the efficiency of clustering. However, with sampling applied, those points which are not sampled will not have t...
Hung-Leng Chen, Kun-Ta Chuang, Ming-Syan Chen