Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

36

NIPS
2003

favoriteEmaildiscussreport

207views Information Technology» more NIPS 2003»

Extending Q-Learning to General Adaptive Multi-Agent Systems

13 years 10 months ago

Extending Q-Learning to General Adaptive Multi-Agent Systems

Download books.nips.cc

Recent multi-agent extensions of Q-Learning require knowledge of other agents’ payoffs and Q-functions, and assume game-theoretic play at all times by all other agents. This paper proposes a fundamentally different approach, dubbed “Hyper-Q” Learning, in which values of mixed strategies rather than base actions are learned, and in which other agents’ strategies are estimated from observed actions via Bayesian inference. Hyper-Q may be effective against many different types of adaptive agents, even if they are persistently dynamic. Against certain broad categories of adaptation, it is argued that Hyper-Q may converge to exact optimal time-varying policies. In tests using Rock-Paper-Scissors, Hyper-Q learns to significantly exploit an Infinitesimal Gradient Ascent (IGA) player, as well as a Policy Hill Climber (PHC) player. Preliminary analysis of Hyper-Q against itself is also presented.

Gerald Tesauro

Real-time Traffic

Certain Broad Categories | NIPS 2003 | NIPS 2007 | Optimal Time-varying Policies | Q-Learning Require Knowledge |

claim paper

Related Content

» Recognition of MultiAgent Interaction in Video Surveillance

» A Generic and Extendible MultiAgent Data Mining Framework

» From a Conceptual Framework for Agents and Objects to a MultiAgent System Modeling Languag...

» An autonomous performance control framework for Distributed MultiAgent Systems a queueing ...

» Role evolution in Open MultiAgent Systems as an information source for trust

» Tools for Developing and Monitoring Agents in Distributed MultiAgent Systems

» ExpectationStock Dynamics in MultiAgent Fisheries

» Towards the Application of ArgumentationBased Dialogues for Education

» Rule responder RuleMLbased agents for distributed collaboration on the pragmatic web

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2003
Where	NIPS
Authors	Gerald Tesauro

Comments (0)