Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

13

NIPS
2004

favoriteEmaildiscussreport

101views Information Technology» more NIPS 2004»

Convergence and No-Regret in Multiagent Learning

13 years 5 months ago

Convergence and No-Regret in Multiagent Learning

Download books.nips.cc

Learning in a multiagent system is a challenging problem due to two key factors. First, if other agents are simultaneously learning then the environment is no longer stationary, thus undermining convergence guarantees. Second, learning is often susceptible to deception, where the other agents may be able to exploit a learner's particular dynamics. In the worst case, this could result in poorer performance than if the agent was not learning at all. These challenges are identifiable in the two most common evaluation criteria for multiagent learning algorithms: convergence and regret. Algorithms focusing on convergence or regret in isolation are numerous. In this paper, we seek to address both criteria in a single algorithm by introducing GIGA-WoLF, a learning algorithm for normalform games. We prove the algorithm guarantees at most zero average regret, while demonstrating the algorithm converges in many situations of self-play. We prove convergence in a limited setting and give emp...

Michael H. Bowling

Real-time Traffic

Algorithm | Convergence | Learning Algorithm | NIPS 2004 | NIPS 2007 |

claim paper

Related Content

» Unifying Convergence and NoRegret in Multiagent Learning

» A General Class of NoRegret Learning Algorithms and GameTheoretic Equilibria

» AWESOME A General Multiagent Learning Algorithm that Converges in SelfPlay and Learns a Be...

» Multiagent reinforcement learning algorithm converging to Nash equilibrium in generalsum d...

» Convergence Targeted Optimality and Safety in Multiagent Learning

» How groups develop a specialized domain vocabulary A cognitive multiagent model

» Using adaptive consultation of experts to improve convergence rates in multiagent learning

» Convergence Problems of GeneralSum Multiagent Reinforcement Learning

» Regret based dynamics convergence in weakly acyclic games

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2004
Where	NIPS
Authors	Michael H. Bowling

Comments (0)