Sciweavers

1577 search results - page 51 / 316
» Data Mining: Machine Learning, Statistics, and Databases
Sort
View
ICML
2001
IEEE
16 years 2 months ago
Smoothed Bootstrap and Statistical Data Cloning for Classifier Evaluation
This work is concerned with the estimation of a classifier's accuracy. We first review some existing methods for error estimation, focusing on cross-validation and bootstrap,...
Gregory Shakhnarovich, Ran El-Yaniv, Yoram Baram
KDD
2010
ACM
265views Data Mining» more  KDD 2010»
15 years 5 months ago
Combining predictions for accurate recommender systems
We analyze the application of ensemble learning to recommender systems on the Netflix Prize dataset. For our analysis we use a set of diverse state-of-the-art collaborative filt...
Michael Jahrer, Andreas Töscher, Robert Legen...
KDD
2009
ACM
219views Data Mining» more  KDD 2009»
16 years 2 months ago
Structured correspondence topic models for mining captioned figures in biological literature
A major source of information (often the most crucial and informative part) in scholarly articles from scientific journals, proceedings and books are the figures that directly pro...
Amr Ahmed, Eric P. Xing, William W. Cohen, Robert ...
KDD
2004
ACM
117views Data Mining» more  KDD 2004»
16 years 1 months ago
Predicting customer shopping lists from point-of-sale purchase data
This paper describes a prototype that predicts the shopping lists for customers in a retail store. The shopping list prediction is one aspect of a larger system we have developed ...
Chad M. Cumby, Andrew E. Fano, Rayid Ghani, Marko ...
ICDM
2005
IEEE
187views Data Mining» more  ICDM 2005»
15 years 7 months ago
Parallel Algorithms for Distance-Based and Density-Based Outliers
An outlier is an observation that deviates so much from other observations as to arouse suspicion that it was generated by a different mechanism. Outlier detection has many applic...
Elio Lozano, Edgar Acuña