On step sizes, stochastic shortest paths, and survival probabilities in Reinforcement Learning

13 years 6 months ago

Download www.informs-sim.org

Reinforcement Learning (RL) is a simulation-based technique useful in solving Markov decision processes if their transition probabilities are not easily obtainable or if the problems have a very large number of states. We present an empirical study of (i) the effect of step-sizes (learning rules) in the convergence of RL algorithms, (ii) stochastic shortest paths in solving average reward problems via RL, and (iii) the notion of survival probabilities (downside risk) in RL. We also study the impact of step sizes when function approximation is combined with RL. Our experiments yield some interesting insights that will be useful in practice when RL algorithms are implemented within simulators.

Abhijit Gosavi

Real-time Traffic

Average Reward Problems | Modeling And Simulation | RL Algorithms | Stochastic Shortest Paths | WSC 2008 |

claim paper

Post Info
More Details (n/a)

Added	02 Oct 2010
Updated	02 Oct 2010
Type	Conference
Year	2008
Where	WSC
Authors	Abhijit Gosavi

Comments (0)

Sciweavers

On step sizes, stochastic shortest paths, and survival probabilities in Reinforcement Learning

Average Reward Problems | Modeling And Simulation | RL Algorithms | Stochastic Shortest Paths | WSC 2008 |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers