Finite-sample Analysis of Bellman Residual Minimization

13 years 4 months ago

Download jmlr.csail.mit.edu

We consider the Bellman residual minimization approach for solving discounted Markov decision problems, where we assume that a generative model of the dynamics and rewards is available. At each policy iteration step, an approximation of the value function for the current policy is obtained by minimizing an empirical Bellman residual defined on a set of n states drawn i.i.d. from a distribution

Odalric-Ambrym Maillard, Rémi Munos, Alessa

Real-time Traffic

Bellman Residual Minimization | Empirical Bellman | JMLR 2010 | Value Functions |

claim paper

Post Info
More Details (n/a)

Added	19 May 2011
Updated	19 May 2011
Type	Journal
Year	2010
Where	JMLR
Authors	Odalric-Ambrym Maillard, Rémi Munos, Alessandro Lazaric, Mohammad Ghavamzadeh

Comments (0)

Sciweavers

Finite-sample Analysis of Bellman Residual Minimization

Bellman Residual Minimization | Empirical Bellman | JMLR 2010 | Value Functions |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers