Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

14

AAAI
2010

favoriteEmaildiscussreport

191views Intelligent Agents» more AAAI 2010»

Relative Entropy Policy Search

13 years 6 months ago

Relative Entropy Policy Search

Download www.kyb.tuebingen.mpg.de

Policy search is a successful approach to reinforcement learning. However, policy improvements often result in the loss of information. Hence, it has been marred by premature convergence and implausible solutions. As first suggested in the context of covariant policy gradients (Bagnell and Schneider 2003), many of these problems may be addressed by constraining the information loss. In this paper, we continue this path of reasoning and suggest the Relative Entropy Policy Search (REPS) method. The resulting method differs significantly from previous policy gradient approaches and yields an exact update step. It works well on typical reinforcement learning benchmark problems.

Jan Peters, Katharina Mülling, Yasemin Altun

Real-time Traffic

AAAI 2010 | Intelligent Agents | Policy | Policy Gradient | Policy Search |

claim paper

Related Content

» Hierarchical Relative Entropy Policy Search

» The Cross Entropy Method for Fast Policy Search

» CrossEntropy Optimization of Control Policies With Adaptive Basis Functions

» The CrossEntropy Method for Policy Search in Decentralized POMDPs

» EntropyBased Authorship Search in Large Document Collections

» Study on interaction between entropy pruning and kneserney smoothing

» Encountering stronger password requirements user attitudes and behaviors

» EntropyBased Modeling and Simulation of Evolution in Biological Systems

» Covariant Policy Search

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2010
Where	AAAI
Authors	Jan Peters, Katharina Mülling, Yasemin Altun

Comments (0)