Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

109

AAMAS
2007
Springer

142views Intelligent Agents» more AAMAS 2007»

Parallel Reinforcement Learning with Linear Function Approximation

15 years 3 months ago

Parallel Reinforcement Learning with Linear Function Approximation

Download www.aamas-conference.org

In this paper, we investigate the use of parallelization in reinforcement learning (RL), with the goal of learning optimal policies for single-agent RL problems more quickly by using parallel hardware. Our approach is based on agents using the SARSA(λ) algorithm, with value functions represented using linear function approximators. In our proposed method, each agent learns independently in a separate simulation of the single-agent problem. The agents periodically exchange information extracted from the weights of their approximators, accelerating convergence towards the optimal policy. We present empirical results for an implementation on a Beowulf cluster. Categories and Subject Descriptors I.2.6 [Artiﬁcial Intelligence]: Learning General Terms Algorithms, Performance, Experimentation Keywords Reinforcement learning, value function approximation, parallel algorithms

Matthew Grounds, Daniel Kudenko

Real-time Traffic

AAMAS 2007 | Intelligent Agents | Linear Function Approximators | Single-agent Rl Problems | Value Function |

claim paper

Related Content

» Tracking value function dynamics to improve reinforcement learning with piecewise linear f...

» Convergence of synchronous reinforcement learning with linear function approximation

» Rates of Convergence of Performance Gradient Estimates Using Function Approximation and Bi...

» Value Function Approximation in Reinforcement Learning Using the Fourier Basis

» Efficient exploration through active learning for value function approximation in reinforc...

» Automatic basis function construction for approximate dynamic programming and reinforcemen...

» Residual Algorithms Reinforcement Learning with Function Approximation

» Learning Heuristic Functions through Approximate Linear Programming

» Least absolute policy iteration for robust value function approximation

Post Info
More Details (n/a)

Added	08 Dec 2010
Updated	08 Dec 2010
Type	Journal
Year	2007
Where	AAMAS
Authors	Matthew Grounds, Daniel Kudenko

Comments (0)