Nonparametric Return Distribution Approximation for Reinforcement Learning

13 years 5 months ago

Download www.icml2010.org

Standard Reinforcement Learning (RL) aims to optimize decision-making rules in terms of the expected return. However, especially for risk-management purposes, other criteria such as the expected shortfall are sometimes preferred. Here, we describe a method of approximating the distribution of returns, which allows us to derive various kinds of information about the returns. We first show that the Bellman equation, which is a recursive formula for the expected return, can be extended to the cumulative return distribution. Then we derive a nonparametric return distribution estimator with particle smoothing based on this extended Bellman equation. A key aspect of the proposed algorithm is to represent the recursion relation in the extended Bellman equation by a simple replacement procedure of particles associated with a state by using those of the successor state. We show that our algorithm leads to a risksensitive RL paradigm. The usefulness of the proposed approach is demonstrated thro...

Tetsuro Morimura, Masashi Sugiyama, Hisashi Kashim

Real-time Traffic

Bellman Equation | Expected Return | Extended Bellman Equation | ICML 2010 | Machine Learning |

claim paper

» Regularized Policy Iteration

» Using inaccurate models in reinforcement learning

» Multitask reinforcement learning a hierarchical Bayesian approach

» Testing Probabilistic Equivalence Through Reinforcement Learning

» Gaussian process product models for nonparametric nonstationarity

» A reinforcement learning based distributed search algorithm for hierarchical peertopeer in...

» A Bayesian Framework for Reinforcement Learning

» BayesAdaptive POMDPs

Post Info
More Details (n/a)

Added	09 Nov 2010
Updated	09 Nov 2010
Type	Conference
Year	2010
Where	ICML
Authors	Tetsuro Morimura, Masashi Sugiyama, Hisashi Kashima, Hirotaka Hachiya, Toshiyuki Tanaka

Comments (0)

Sciweavers

Nonparametric Return Distribution Approximation for Reinforcement Learning

Bellman Equation | Expected Return | Extended Bellman Equation | ICML 2010 | Machine Learning |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers