Sciweavers

4 search results - page 1 / 1
» A Generalized Kalman Filter for Fixed Point Approximation an...
Sort
View
EWRL
2008
13 years 6 months ago
Bayesian Reward Filtering
A wide variety of function approximation schemes have been applied to reinforcement learning. However, Bayesian filtering approaches, which have been shown efficient in other field...
Matthieu Geist, Olivier Pietquin, Gabriel Fricout
ICML
1999
IEEE
14 years 5 months ago
Least-Squares Temporal Difference Learning
Excerpted from: Boyan, Justin. Learning Evaluation Functions for Global Optimization. Ph.D. thesis, Carnegie Mellon University, August 1998. (Available as Technical Report CMU-CS-...
Justin A. Boyan
ATAL
2008
Springer
13 years 7 months ago
Sigma point policy iteration
In reinforcement learning, least-squares temporal difference methods (e.g., LSTD and LSPI) are effective, data-efficient techniques for policy evaluation and control with linear v...
Michael H. Bowling, Alborz Geramifard, David Winga...