Sciweavers

267 search results - page 23 / 54
» The Dynamics of Multi-Agent Reinforcement Learning
Sort
View
ICML
1996
IEEE
15 years 10 months ago
Sensitive Discount Optimality: Unifying Discounted and Average Reward Reinforcement Learning
Research in reinforcementlearning (RL)has thus far concentrated on two optimality criteria: the discounted framework, which has been very well-studied, and the averagereward frame...
Sridhar Mahadevan
70
Voted
ICES
2003
Springer
125views Hardware» more  ICES 2003»
15 years 2 months ago
Evolving Reinforcement Learning-Like Abilities for Robots
Abstract. In [8] Yamauchi and Beer explored the abilities of continuous time recurrent neural networks (CTRNNs) to display reinforcementlearning like abilities. The investigated ta...
Jesper Blynel
NIPS
2007
14 years 11 months ago
Online Linear Regression and Its Application to Model-Based Reinforcement Learning
We provide a provably efficient algorithm for learning Markov Decision Processes (MDPs) with continuous state and action spaces in the online setting. Specifically, we take a mo...
Alexander L. Strehl, Michael L. Littman
ICRA
2007
IEEE
110views Robotics» more  ICRA 2007»
15 years 4 months ago
A Reinforcement Learning Approach to Lift Generation in Flapping MAVs: Experimental Results
— In [17] we proposed an RL framework for control of flapping-wing MAVs. The algorithm has been discussed and simulation results using a quasi-steady model showed initial promis...
Mehran Motamed, Joseph Yan
ATAL
2007
Springer
15 years 3 months ago
IFSA: incremental feature-set augmentation for reinforcement learning tasks
Reinforcement learning is a popular and successful framework for many agent-related problems because only limited environmental feedback is necessary for learning. While many algo...
Mazda Ahmadi, Matthew E. Taylor, Peter Stone