Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

5

ECAI
2008
Springer

favoriteEmaildiscussreport

124views Artificial Intelligence» more ECAI 2008»

Exploiting locality of interactions using a policy-gradient approach in multiagent learning

13 years 6 months ago

Exploiting locality of interactions using a policy-gradient approach in multiagent learning

Download gaips.inesc-id.pt

In this paper, we propose a policy gradient reinforcement learning algorithm to address transition-independent Dec-POMDPs. This approach aims at implicitly exploiting the locality of interaction observed in many practical problems. Our algorithms can be described by an actor-critic architecture: the actor component combines natural gradient updates with a varying learning rate; the critic uses only local information to maintain a belief over the joint state-space, and evaluates the current policy as a function of this belief using compatible function approximation. In order to speed the convergence of the algorithm, we use an optimistic initialization of the policy that relies on a fully observable, single agent model of the problem. We illustrate our approach in some simple application problems.

Francisco S. Melo

Real-time Traffic

Artificial Intelligence | Compatible Function Approximation | ECAI 2008 | Natural Gradient Updates | Policy Gradient Reinforcement |

claim paper

Related Content

» Learning of coordination exploiting sparse interactions in multiagent systems

» MBAIMFSI a model based framework for exploiting gradient ascent multiagent learners in str...

» Coordinated MultiAgent Reinforcement Learning in Networked Distributed POMDPs

» Multiagent Relational Reinforcement Learning

» Adjustable autonomy in realworld multiagent environments

» Collaborative Multiagent Reinforcement Learning by Payoff Propagation

» Coordination in utility managed multiagent groups

» Multiagent reinforcement learning and selforganization in a network of agents

» Multiagent learning in adaptive dynamic systems

Post Info
More Details (n/a)

Added	19 Oct 2010
Updated	19 Oct 2010
Type	Conference
Year	2008
Where	ECAI
Authors	Francisco S. Melo

Comments (0)