Sciweavers

2497 search results - page 118 / 500
» A Partial-Repeatability Approach to Data Mining
Sort
View
KDD
2006
ACM
117views Data Mining» more  KDD 2006»
15 years 10 months ago
Efficient multidimensional data representations based on multiple correspondence analysis
In the On Line Analytical Processing (OLAP) context, exploration of huge and sparse data cubes is a tedious task which does not always lead to efficient results. In this paper, we...
Omar Boussaid, Riadh Ben Messaoud, Sabine Loudcher...
ICDM
2008
IEEE
110views Data Mining» more  ICDM 2008»
15 years 4 months ago
Start Globally, Optimize Locally, Predict Globally: Improving Performance on Imbalanced Data
Class imbalance is a ubiquitous problem in supervised learning and has gained wide-scale attention in the literature. Perhaps the most prevalent solution is to apply sampling to t...
David A. Cieslak, Nitesh V. Chawla
ICDM
2010
IEEE
130views Data Mining» more  ICDM 2010»
14 years 8 months ago
Using Taxonomies to Perform Aggregated Querying over Imprecise Data
-- In this paper, we put forward our approach for answering aggregated queries over imprecise data using domain specific taxonomies. A new concept we call the weighted hierarchical...
Atanu Roy, Chandrima Sarkar, Rafal A. Angryk
SDM
2009
SIAM
343views Data Mining» more  SDM 2009»
15 years 7 months ago
Change-Point Detection in Time-Series Data by Direct Density-Ratio Estimation.
Change-point detection is the problem of discovering time points at which properties of time-series data change. This covers a broad range of real-world problems and has been acti...
Masashi Sugiyama, Yoshinobu Kawahara
ICIC
2005
Springer
15 years 3 months ago
Borderline-SMOTE: A New Over-Sampling Method in Imbalanced Data Sets Learning
In recent years, mining with imbalanced data sets receives more and more attentions in both theoretical and practical aspects. This paper introduces the importance of imbalanced da...
Hui Han, Wenyuan Wang, Binghuan Mao