Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

33

ICML
2007
IEEE

favoriteEmaildiscussreport

180views Machine Learning» more ICML 2007»

Tracking value function dynamics to improve reinforcement learning with piecewise linear function approximation

14 years 10 months ago

Tracking value function dynamics to improve reinforcement learning with piecewise linear function approximation

Download www.machinelearning.org

Reinforcement learning algorithms can become unstable when combined with linear function approximation. Algorithms that minimize the mean-square Bellman error are guaranteed to converge, but often do so slowly or are computationally expensive. In this paper, we propose to improve the convergence speed of piecewise linear function approximation by tracking the dynamics of the value function with the Kalman filter using a random-walk model. We cast this as a general framework in which we implement the TD, Q-Learning and MAXQ algorithms for different domains, and report empirical results demonstrating improved learning speed over previous methods.

Chee Wee Phua, Robert Fitch

Real-time Traffic

ICML 2007 | Linear Function Approximation | Machine Learning | Piecewise Linear Function | Reinforcement Learning Algorithms |

claim paper

Related Content

» Automatic basis function construction for approximate dynamic programming and reinforcemen...

» Improving reinforcement learning function approximators via neuroevolution

» DynaStyle Planning with Linear Function Approximation and Prioritized Sweeping

» Adaptive bases for Qlearning

» ModelBased Average Reward Reinforcement Learning

» Multidimensional Triangulation and Interpolation for Reinforcement Learning

» Planning with predictive state representations

» BayesAdaptive POMDPs

» LeastSquares Temporal Difference Learning

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2007
Where	ICML
Authors	Chee Wee Phua, Robert Fitch

Comments (0)