Sciweavers

1227 search results - page 141 / 246
» Learning Rates for Q-Learning
Sort
View
ML
2007
ACM
127views Machine Learning» more  ML 2007»
14 years 9 months ago
Density estimation with stagewise optimization of the empirical risk
We consider multivariate density estimation with identically distributed observations. We study a density estimator which is a convex combination of functions in a dictionary and ...
Jussi Klemelä
ML
2007
ACM
192views Machine Learning» more  ML 2007»
14 years 9 months ago
Annealing stochastic approximation Monte Carlo algorithm for neural network training
We propose a general-purpose stochastic optimization algorithm, the so-called annealing stochastic approximation Monte Carlo (ASAMC) algorithm, for neural network training. ASAMC c...
Faming Liang
COLT
2010
Springer
14 years 8 months ago
Open Loop Optimistic Planning
We consider the problem of planning in a stochastic and discounted environment with a limited numerical budget. More precisely, we investigate strategies exploring the set of poss...
Sébastien Bubeck, Rémi Munos
ICDE
2008
IEEE
146views Database» more  ICDE 2008»
15 years 11 months ago
Explaining and Reformulating Authority Flow Queries
Authority flow is an effective ranking mechanism for answering queries on a broad class of data. Systems have been developed to apply this principle on the Web (PageRank and topic ...
Ramakrishna Varadarajan, Vagelis Hristidis, Louiqa...
CASON
2009
IEEE
15 years 4 months ago
Social Network - An Autonomous System Designed for Radio Recommendation
This paper describes the functions of a system proposed for the music tube recommendation from social network data base. Such a system enables the automatic collection, evaluation...
Grzegorz Dziczkowski, Lamine Bougueroua, Katarzyna...