Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

15

ICML
2001
IEEE

favoriteEmaildiscussreport

172views Machine Learning» more ICML 2001»

Continuous-Time Hierarchical Reinforcement Learning

14 years 5 months ago

Continuous-Time Hierarchical Reinforcement Learning

Download www.cs.ualberta.ca

Hierarchical reinforcement learning (RL) is a general framework which studies how to exploit the structure of actions and tasks to accelerate policy learning in large domains. Prior work in hierarchical RL, such as the MAXQ method, has been limited to the discrete-time discounted reward semiMarkov decision process (SMDP) model. This paper generalizes the MAXQ method to continuous-time discounted and average reward SMDP models. We describe two hierarchical reinforcement learning algorithms: continuous-time discounted reward MAXQ and continuous-time average reward MAXQ. We apply these algorithms to a complex multiagent AGV scheduling problem, and compare their performance and speed with each other, as well as several well-known AGV scheduling heuristics.

Mohammad Ghavamzadeh, Sridhar Mahadevan

Real-time Traffic

Average Reward Maxq | Discrete-time Discounted Reward | ICML 2001 | Machine Learning | MAXQ Method |

claim paper

Related Content

» Efficient ContinuousTime Reinforcement Learning with Adaptive State Graphs

» Reinforcement Learning in Continuous Time and Space

» TeXDYNA Hierarchical Reinforcement Learning in Factored MDPs

» Learning to Fly An Application of Hierarchical Reinforcement Learning

» Policy Gradient in Continuous Time

» Hierarchical reinforcement learning with subpolicies specializing for learned subgoals

» The MAXQ Method for Hierarchical Reinforcement Learning

» Intrusion Detection using Continuous Time Bayesian Networks

» Using ILP to Improve Planning in Hierarchical Reinforcement Learning

Post Info
More Details (n/a)

Added	17 Nov 2009
Updated	17 Nov 2009
Type	Conference
Year	2001
Where	ICML
Authors	Mohammad Ghavamzadeh, Sridhar Mahadevan

Comments (0)