Search Sciweavers | Sciweavers

16

ATAL
2006
Springer

127views Intelligent Agents» more ATAL 2006»

13 years 8 months ago

We address the problem of learning in repeated N-player (as opposed to 2-player) general-sum games. We describe an extension to existing criteria focusing explicitly on such setti...

Thuc Vu, Rob Powers, Yoav Shoham

claim paper

Read More »

12

click to vote

ICML
2003
IEEE

156views Machine Learning» more ICML 2003»

AWESOME: A General Multiagent Learning Algorithm that Converges in Self-Play and Learns a Best Response Against Stationary Oppon

14 years 5 months ago

Download www-2.cs.cmu.edu

A satisfactory multiagent learning algorithm should, at a minimum, learn to play optimally against stationary opponents and converge to a Nash equilibrium in self-play. The algori...

Vincent Conitzer, Tuomas Sandholm

claim paper

Read More »

13

click to vote

AAAI
2006

120views Intelligent Agents» more AAAI 2006»

Boosting Expert Ensembles for Rapid Concept Recall

13 years 6 months ago

Download www.aaai.org

Many learning tasks in adversarial domains tend to be highly dependent on the opponent. Predefined strategies optimized for play against a specific opponent are not likely to succ...

Achim Rettinger, Martin Zinkevich, Michael H. Bowl...

claim paper

Read More »

15

click to vote

ATAL
2010
Springer

181views Intelligent Agents» more ATAL 2010»

Planning against fictitious players in repeated normal form games

13 years 5 months ago

Download www.aamas-conference.org

Planning how to interact against bounded memory and unbounded memory learning opponents needs different treatment. Thus far, however, work in this area has shown how to design pla...

Enrique Munoz de Cote, Nicholas R. Jennings

claim paper

Read More »

9

click to vote

LAMAS
2005
Springer

124views Intelligent Agents» more LAMAS 2005»

Unifying Convergence and No-Regret in Multiagent Learning

13 years 10 months ago

Download orca.st.usm.edu

We present a new multiagent learning algorithm, RVσ(t), that builds on an earlier version, ReDVaLeR . ReDVaLeR could guarantee (a) convergence to best response against stationary ...

Bikramjit Banerjee, Jing Peng

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers