Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

33

ICML
1994
IEEE

favoriteEmaildiscussreport

152views Machine Learning» more ICML 1994»

Markov Games as a Framework for Multi-Agent Reinforcement Learning

14 years 27 days ago

Markov Games as a Framework for Multi-Agent Reinforcement Learning

Download www.cs.rutgers.edu

In the Markov decision process (MDP) formalization of reinforcement learning, a single adaptive agent interacts with an environment defined by a probabilistic transition function. In this solipsistic view, secondary agents can only be part of the environment and are therefore fixed in their behavior. The framework of Markov games allows us to widen this view to include multiple adaptive agents with interacting or competing goals. This paper considers a step in this direction in which exactly two agents with diametrically opposed goals share an environment. It describes a Q-learning-like algorithm for finding optimal policies and demonstrates itsapplicationto a simple two-player game in which the optimal policy is probabilistic.

Michael L. Littman

Real-time Traffic

Adaptive Agent Interacts | Adaptive Agents | ICML 1994 | Machine Learning | Markov Decision Process |

claim paper

Related Content

» An Evolutionary Dynamical Analysis of MultiAgent Learning in Iterated Games

» Decentralized Learning in Markov Games

» The Dynamics of MultiAgent Reinforcement Learning

» Product Distribution Theory for Control of MultiAgent Systems

» A selectionmutation model for qlearning in multiagent systems

» Networks of Learning Automata and Limiting Games

» The Self Organization of Context for Learning in MultiAgent Games

» Bayesian Policy Search for MultiAgent Role Discovery

» Strategic Foresighted Learning in Competitive MultiAgent Games

Post Info
More Details (n/a)

Added	27 Aug 2010
Updated	27 Aug 2010
Type	Conference
Year	1994
Where	ICML
Authors	Michael L. Littman

Comments (0)