Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

88

Voted

IDEAL
2004
Springer

favoriteEmaildiscussreport

94views Intelligent Agents» more IDEAL 2004»

Policy Gradient Method for Team Markov Games

15 years 4 months ago

Policy Gradient Method for Team Markov Games

Download www.cis.hut.fi

The main aim of this paper is to extend the single-agent policy gradient method for multiagent domains where all agents share the same utility function. We formulate these team problems as Markov games endowed with the asymmetric equilibrium concept and based on this formulation, we provide a direct policy gradient learning method. In addition, we test the proposed method with a small example problem.

Ville Könönen

Real-time Traffic

Asymmetric Equilibrium Concept | Gradient Learning Method | IDEAL 2004 | Policy Gradient Method |

claim paper

Related Content

» Adaptive Stepsize Policy Gradients with Average Reward Metric

» Distributed Optimization in Adaptive Networks

» Emerging coordination in infinite team Markov games

» Geometric Variance Reduction in Markov Chains Application to Value Function and Gradient E...

» Solving Deep Memory POMDPs with Recurrent Policy Gradients

» Particle Filterbased Policy Gradient in POMDPs

» Parameterexploring policy gradients

» Predictive representations for policy gradient in POMDPs

» Policy Gradient Critics

Post Info
More Details (n/a)

Added	02 Jul 2010
Updated	02 Jul 2010
Type	Conference
Year	2004
Where	IDEAL
Authors	Ville Könönen

Comments (0)