Sciweavers

2065 search results - page 319 / 413
» Techniques of Cluster Algorithms in Data Mining
Sort
View
SDM
2003
SIAM
129views Data Mining» more  SDM 2003»
15 years 3 months ago
Approximate Query Answering by Model Averaging
In earlier work we have introduced and explored a variety of different probabilistic models for the problem of answering selectivity queries posed to large sparse binary data set...
Dmitry Pavlov, Padhraic Smyth
KDD
1999
ACM
199views Data Mining» more  KDD 1999»
15 years 6 months ago
The Application of AdaBoost for Distributed, Scalable and On-Line Learning
We propose to use AdaBoost to efficiently learn classifiers over very large and possibly distributed data sets that cannot fit into main memory, as well as on-line learning wher...
Wei Fan, Salvatore J. Stolfo, Junxin Zhang
FLAIRS
2008
15 years 4 months ago
Building Useful Models from Imbalanced Data with Sampling and Boosting
Building useful classification models can be a challenging endeavor, especially when training data is imbalanced. Class imbalance presents a problem when traditional classificatio...
Chris Seiffert, Taghi M. Khoshgoftaar, Jason Van H...
SIGMOD
2008
ACM
191views Database» more  SIGMOD 2008»
16 years 2 months ago
Efficient aggregation for graph summarization
Graphs are widely used to model real world objects and their relationships, and large graph datasets are common in many application domains. To understand the underlying character...
Yuanyuan Tian, Richard A. Hankins, Jignesh M. Pate...
IVC
2007
94views more  IVC 2007»
15 years 1 months ago
Vector quantization and fuzzy ranks for image reconstruction
The problem of clustering is often addressed with techniques based on a Voronoi partition of the data space. Vector quantization is based on a similar principle, but it is a diffe...
Stefano Rovetta, Francesco Masulli