Sciweavers

267 search results - page 23 / 54
» The Dynamics of Multi-Agent Reinforcement Learning
Sort
View
ICML
1996
IEEE
16 years 17 days ago
Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning
Research in reinforcementlearning (RL)has thus far concentrated on two optimality criteria: the discounted framework, which has been very well-studied, and the averagereward frame...
Sridhar Mahadevan
ICES
2003
Springer
125views Hardware» more  ICES 2003»
15 years 5 months ago
Evolving Reinforcement Learning-Like Abilities for Robots
Abstract. In [8] Yamauchi and Beer explored the abilities of continuous time recurrent neural networks (CTRNNs) to display reinforcementlearning like abilities. The investigated ta...
Jesper Blynel
NIPS
2007
15 years 1 months ago
Online Linear Regression and Its Application to Model-Based Reinforcement Learning
We provide a provably efficient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Specifically, we take a mo...
Alexander L. Strehl, Michael L. Littman
ICRA
2007
IEEE
110views Robotics» more  ICRA 2007»
15 years 6 months ago
A Reinforcement Learning Approach to Lift Generation in Flapping MAVs: Experimental Results
— In [17] we proposed an RL framework for control of flapping-wing MAVs. The algorithm has been discussed and simulation results using a quasi-steady model showed initial promis...
Mehran Motamed, Joseph Yan
ATAL
2007
Springer
15 years 6 months ago
IFSA: incremental feature-set augmentation for reinforcement learning tasks
Reinforcement learning is a popular and successful framework for many agent-related problems because only limited environmental feedback is necessary for learning. While many algo...
Mazda Ahmadi, Matthew E. Taylor, Peter Stone