Sciweavers

65 search results - page 1 / 13
» Distributed Data Mining vs. Sampling Techniques: A Compariso...
Sort
View
AI
2004
Springer
13 years 10 months ago
Distributed Data Mining vs. Sampling Techniques: A Comparison
To address the of mining a huge volume of geographically distributed databases, we propose two approaches. The first one is to download only a sample of each database. The second ...
Mohamed Aounallah, Sébastien Quirion, Guy W...
VLDB
2005
ACM
196views Database» more  VLDB 2005»
13 years 10 months ago
Summarizing and Mining Inverse Distributions on Data Streams via Dynamic Inverse Sampling
Emerging data stream management systems approach the challenge of massive data distributions which arrive at high speeds while there is only small storage by summarizing and minin...
Graham Cormode, S. Muthukrishnan, Irina Rozenbaum
KDD
2006
ACM
149views Data Mining» more  KDD 2006»
14 years 5 months ago
Regularized discriminant analysis for high dimensional, low sample size data
Linear and Quadratic Discriminant Analysis have been used widely in many areas of data mining, machine learning, and bioinformatics. Friedman proposed a compromise between Linear ...
Jieping Ye, Tie Wang
ICAI
2004
13 years 6 months ago
A Comparison of Resampling Methods for Clustering Ensembles
-- Combination of multiple clusterings is an important task in the area of unsupervised learning. Inspired by the success of supervised bagging algorithms, we propose a resampling ...
Behrouz Minaei-Bidgoli, Alexander P. Topchy, Willi...
ICPR
2008
IEEE
13 years 11 months ago
MCS-based balancing techniques for skewed classes: An empirical comparison
The class imbalance is a critical problem in classification tasks related to many real world applications. A large number of solutions were proposed in literature, both at the al...
Maria Teresa Ricamato, Claudio Marrocco, Francesco...