Sciweavers

1233 search results - page 110 / 247
» Reinforcement Learning in MirrorBot
Sort
View
ICML
2010
IEEE
14 years 8 months ago
Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda
Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...
Carlton Downey, Scott Sanner
ROBOCUP
2005
Springer
134views Robotics» more  ROBOCUP 2005»
15 years 3 months ago
Simultaneous Learning to Acquire Competitive Behaviors in Multi-agent System Based on Modular Learning System
The existing reinforcement learning approaches have been suffering from the policy alternation of others in multiagent dynamic environments. A typical example is a case of RoboCup...
Yasutake Takahashi, Kazuhiro Edazawa, Kentarou Nom...
ATAL
2009
Springer
15 years 4 months ago
Generalized model learning for reinforcement learning in factored domains
Improving the sample efficiency of reinforcement learning algorithms to scale up to larger and more realistic domains is a current research challenge in machine learning. Model-ba...
Todd Hester, Peter Stone
ICML
1998
IEEE
15 years 10 months ago
RL-TOPS: An Architecture for Modularity and Re-Use in Reinforcement Learning
This paper introduces the RL-TOPs architecture for robot learning, a hybrid system combining teleo-reactive planning and reinforcement learning techniques. The aim of this system ...
Malcolm R. K. Ryan, Mark D. Pendrith
ICML
2005
IEEE
15 years 10 months ago
Learning strategies for story comprehension: a reinforcement learning approach
This paper describes the use of machine learning to improve the performance of natural language question answering systems. We present a model for improving story comprehension th...
Eugene Grois, David C. Wilkins