Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

21

ATAL
2005
Springer

favoriteEmaildiscussreport

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

13 years 10 months ago

Improving reinforcement learning function approximators via neuroevolution

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of taking each action in each state. In most problems of realworld interest, learning this value function requires a function approximator, which represents the mapping from stateaction pairs to values via a concise, parameterized function and uses supervised learning methods to set its parameters. Function approximators make it possible to use temporal difference methods on large problems but, in practice, the feasibility of doing so depends on the ability of the human designer to select an appropriate representation for the value function. My thesis presents a new approach to function approximation that automates some of these difﬁcult design choices by coupling temporal difference methods with policy search methods such as evolutionary computation. It also presents a particular implementation which combines N...

Shimon Whiteson

Real-time Traffic

ATAL 2005 | Function Approximators | Reinforcement Learning Problems | Temporal Difference Method |

claim paper

Related Content

» SampleEfficient Evolutionary Function Approximation for Reinforcement Learning

» Tracking value function dynamics to improve reinforcement learning with piecewise linear f...

» Improving humanoid locomotive performance with learnt approximated dynamics via Gaussian p...

» GradientBased Learning Updates Improve XCS Performance in Multistep Problems

» Asymmetric Multiagent Reinforcement Learning

» Feature selection and policy optimization for distributed instruction placement using rein...

» Basis Function Construction in Reinforcement Learning Using CascadeCorrelation Learning Ar...

» Rollout Sampling Approximate Policy Iteration

» Action Selection in Bayesian Reinforcement Learning

Post Info
More Details (n/a)

Added	26 Jun 2010
Updated	26 Jun 2010
Type	Conference
Year	2005
Where	ATAL
Authors	Shimon Whiteson

Comments (0)