Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

16

ICML
2009
IEEE

favoriteEmaildiscussreport

172views Machine Learning» more ICML 2009»

Model-free reinforcement learning as mixture learning

14 years 5 months ago

Model-free reinforcement learning as mixture learning

Download user.cs.tu-berlin.de

We cast model-free reinforcement learning as the problem of maximizing the likelihood of a probabilistic mixture model via sampling, addressing both the infinite and finite horizon cases. We describe a Stochastic Approximation EM algorithm for likelihood maximization that, in the tabular case, is equivalent to a non-bootstrapping optimistic policy iteration algorithm like Sarsa(1) that can be applied both in MDPs and POMDPs. On the theoretical side, by relating the proposed stochastic EM algorithm to the family of optimistic policy iteration algorithms, we provide new tools that permit the design and analysis of algorithms in that family. On the practical side, preliminary experiments on a POMDP problem demonstrated encouraging results.

Nikos Vlassis, Marc Toussaint

Real-time Traffic

ICML 2009 | Machine Learning | Optimistic Policy Iteration | Policy Iteration Algorithm | Stochastic Em Algorithm |

claim paper

Related Content

» ModelFree LeastSquares Policy Iteration

» Multitask reinforcement learning a hierarchical Bayesian approach

» A Generalized Path Integral Control Approach to Reinforcement Learning

» Risk Sensitive Reinforcement Learning

» Multimodal Parameterexploring Policy Gradients

» Structure Learning in Human Sequential DecisionMaking

» Structure in the Space of Value Functions

» Forgetting Reinforced Cases

» Learning ForceBased Robot Skills from Haptic Demonstration

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2009
Where	ICML
Authors	Nikos Vlassis, Marc Toussaint

Comments (0)