Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

97

CDC
2010
IEEE

favoriteEmaildiscussreport

160views Control Systems» more CDC 2010»

Adaptive bases for Q-learning

14 years 6 months ago

Adaptive bases for Q-learning

Download webee.technion.ac.il

Abstract-- We consider reinforcement learning, and in particular, the Q-learning algorithm in large state and action spaces. In order to cope with the size of the spaces, a function approximation approach to the state and action value function is needed. We generalize the classical Q-learning algorithm to an algorithm where the basis of the linear function approximation change dynamically while interacting with the environment. A motivation for such an approach is maximizing the state-action value function fitness to the problem faced, thus obtaining better performance. The algorithm is shown to converge using two time scales stochastic approximation. Finally, we discuss how this technique can be applied to a rich family of RL algorithms with linear function approximation.

Dotan Di Castro, Shie Mannor

Real-time Traffic

Algorithms | CDC 2010 | Control Systems | Function Approximation | Linear Function Approximation |

claim paper

Related Content

» Dynamic correlation matrix based multiQ learning for a multirobot system

» Extending QLearning to General Adaptive MultiAgent Systems

» The MAXQ Method for Hierarchical Reinforcement Learning

» Development of Open Platform Based Adaptive HCI Concepts for Elderly Users

» The Necessity of Average Rewards in Cooperative Multirobot Learning

» Reinforcement Learning Soccer Teams with Incomplete World Models

» An AntiJamming Stochastic Game for Cognitive Radio Networks

» The Adaptation Model of a Runtime Adaptable DBMS

» Adaptive beamforming method based on constrained LMS algorithm for tracking mobile user

» Adapting an Object Detector by Considering the Worst Case a Conservative Approach

Post Info
More Details (n/a)

Added	16 May 2011
Updated	16 May 2011
Type	Journal
Year	2010
Where	CDC
Authors	Dotan Di Castro, Shie Mannor

Comments (0)