Sciweavers

1256 search results - page 194 / 252
» Experiences with the DEVStone benchmark
Sort
View
SDM
2008
SIAM
177views Data Mining» more  SDM 2008»
15 years 2 months ago
Roughly Balanced Bagging for Imbalanced Data
Imbalanced class problems appear in many real applications of classification learning. We propose a novel sampling method to improve bagging for data sets with skewed class distri...
Shohei Hido, Hisashi Kashima
SDM
2008
SIAM
165views Data Mining» more  SDM 2008»
15 years 2 months ago
On the Dangers of Cross-Validation. An Experimental Evaluation
Cross validation allows models to be tested using the full training set by means of repeated resampling; thus, maximizing the total number of points used for testing and potential...
R. Bharat Rao, Glenn Fung
NIPS
2007
15 years 2 months ago
Discriminative K-means for Clustering
We present a theoretical study on the discriminative clustering framework, recently proposed for simultaneous subspace selection via linear discriminant analysis (LDA) and cluster...
Jieping Ye, Zheng Zhao, Mingrui Wu
104
Voted
AVI
2006
15 years 2 months ago
Task taxonomy for graph visualization
Our goal is to define a list of tasks for graph visualization that has enough detail and specificity to be useful to designers who want to improve their system and to evaluators w...
Bongshin Lee, Catherine Plaisant, Cynthia Sims Par...
82
Voted
ECIR
2006
Springer
15 years 2 months ago
Intrinsic Plagiarism Detection
Current research in the field of automatic plagiarism detection for text documents focuses on algorithms that compare plagiarized documents against potential original documents. Th...
Sven Meyer zu Eissen, Benno Stein