Sciweavers

226 search results - page 14 / 46
» Linear Bayesian Reinforcement Learning
Sort
View
ICML
2006
IEEE
15 years 5 months ago
Automatic basis function construction for approximate dynamic programming and reinforcement learning
We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...
Philipp W. Keller, Shie Mannor, Doina Precup
ICML
2009
IEEE
16 years 18 days ago
Convex variational Bayesian inference for large scale generalized linear models
We show how variational Bayesian inference can be implemented for very large generalized linear models. Our relaxation is proven to be a convex problem for any log-concave model. ...
Hannes Nickisch, Matthias W. Seeger
CG
2006
Springer
15 years 1 months ago
Feature Construction for Reinforcement Learning in Hearts
Temporal difference (TD) learning has been used to learn strong evaluation functions in a variety of two-player games. TD-gammon illustrated how the combination of game tree search...
Nathan R. Sturtevant, Adam M. White
ICRA
1995
IEEE
123views Robotics» more  ICRA 1995»
15 years 3 months ago
Vision-Based Reinforcement Learning for Purposive Behavior Acquisition
This paper presents a method of vision-based reinforcement learning by which a robot learns to shoot a ball into a goal, and discusses several issues in applying the reinforcement...
Minoru Asada, Shoichi Noda, Sukoya Tawaratsumida, ...
ICML
1995
IEEE
16 years 18 days ago
Residual Algorithms: Reinforcement Learning with Function Approximation
A number of reinforcement learning algorithms have been developed that are guaranteed to converge to the optimal solution when used with lookup tables. It is shown, however, that ...
Leemon C. Baird III