Sciweavers

1227 search results - page 37 / 246
» Learning Rates for Q-Learning
Sort
View
EUROCOLT
1999
Springer
15 years 2 months ago
Regularized Principal Manifolds
Many settings of unsupervised learning can be viewed as quantization problems — the minimization of the expected quantization error subject to some restrictions. This allows the ...
Alex J. Smola, Robert C. Williamson, Sebastian Mik...
CEC
2005
IEEE
14 years 11 months ago
A note on the population based incremental learning with infinite population size
In this paper, we study the dynamical properties of the population based incremental learning (PBIL) algorithm when it uses truncation, proportional, and Boltzmann selection schema...
Reza Rastegar, Mohammad Reza Meybodi
COLT
2010
Springer
14 years 7 months ago
Nonparametric Bandits with Covariates
We consider a bandit problem which involves sequential sampling from two populations (arms). Each arm produces a noisy reward realization which depends on an observable random cov...
Philippe Rigollet, Assaf Zeevi
ICDM
2009
IEEE
188views Data Mining» more  ICDM 2009»
14 years 7 months ago
Binomial Matrix Factorization for Discrete Collaborative Filtering
Matrix factorization (MF) models have proved efficient and well scalable for collaborative filtering (CF) problems. Many researchers also present the probabilistic interpretation o...
Jinlong Wu
CVPR
2012
IEEE
13 years 4 days ago
Icon scanning: Towards next generation QR codes
Undoubtedly, a key feature in the popularity of smartmobile devices is the numerous applications one can install. Frequently, we learn about an application we desire by seeing it ...
Itamar Friedman, Lihi Zelnik-Manor