Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

13

ATAL
2009
Springer

favoriteEmaildiscussreport

137views Intelligent Agents» more ATAL 2009»

Generalized model learning for reinforcement learning in factored domains

13 years 11 months ago

Generalized model learning for reinforcement learning in factored domains

Download userweb.cs.utexas.edu

Improving the sample eﬃciency of reinforcement learning algorithms to scale up to larger and more realistic domains is a current research challenge in machine learning. Model-based methods use experiential data more eﬃciently than modelfree approaches but often require exhaustive exploration to learn an accurate model of the domain. We present an algorithm, Reinforcement Learning with Decision Trees (rl-dt), that uses supervised learning techniques to learn the model by generalizing the relative eﬀect of actions across states. Speciﬁcally, rl-dt uses decision trees to model the relative eﬀects of actions in the domain. The agent explores the environment exhaustively in early episodes when its model is inaccurate. Once it believes it has developed an accurate model, it exploits its model, taking the optimal action at each step. The combination of the learning approach with the targeted exploration policy enables fast learning of the model. The sample eﬃciency of the algorit...

Todd Hester, Peter Stone

Real-time Traffic

Artificial Intelligence | ATAL 2009 | Reinforcement Learning | Reinforcement Learning Algorithms | Supervised Learning |

claim paper

Related Content

» Anticipatory Learning Classifier Systems and Factored Reinforcement Learning

» A hierarchical approach to efficient reinforcement learning in deterministic domains

» ModelBased Bayesian Reinforcement Learning in Large Structured Domains

» Generalized model learning for Reinforcement Learning on a humanoid robot

» Comparing evolutionary and temporal difference methods in a reinforcement learning domain

» Scaling ModelBased AverageReward Reinforcement Learning for Product Delivery

» Automatic Feature Selection for ModelBased Reinforcement Learning in Factored MDPs

» Fast Learning in an ActorCritic Architecture with Reward and Punishment

» Learning the structure of Factored Markov Decision Processes in reinforcement learning pro...

Post Info
More Details (n/a)

Added	26 May 2010
Updated	26 May 2010
Type	Conference
Year	2009
Where	ATAL
Authors	Todd Hester, Peter Stone

Comments (0)