Search Sciweavers | Sciweavers

14 search results - page 2 / 3

» On the convergence of regret minimization dynamics in concav...

click to vote

ATAL
2010
Springer

175views Intelligent Agents» more ATAL 2010»

Using counterfactual regret minimization to create competitive multiplayer poker agents

13 years 6 months ago

Download webdocs.cs.ualberta.ca

Games are used to evaluate and advance Multiagent and Artificial Intelligence techniques. Most of these games are deterministic with perfect information (e.g. Chess and Checkers)....

Nicholas Abou Risk, Duane Szafron

claim paper

Read More »

click to vote

NIPS
2004

101views Information Technology» more NIPS 2004»

Convergence and No-Regret in Multiagent Learning

13 years 6 months ago

Download books.nips.cc

Learning in a multiagent system is a challenging problem due to two key factors. First, if other agents are simultaneously learning then the environment is no longer stationary, t...

Michael H. Bowling

claim paper

Read More »

click to vote

ICML
2009
IEEE

159views Machine Learning» more ICML 2009»

Efficient learning algorithms for changing environments

14 years 5 months ago

Download www.cs.princeton.edu

We study online learning in an oblivious changing environment. The standard measure of regret bounds the difference between the cost of the online learner and the best decision in...

Elad Hazan, C. Seshadhri

claim paper

Read More »

click to vote

ECCC
2007

180views more ECCC 2007»

Adaptive Algorithms for Online Decision Problems

13 years 4 months ago

Download ftp.cs.princeton.edu

We study the notion of learning in an oblivious changing environment. Existing online learning algorithms which minimize regret are shown to converge to the average of all locally...

Elad Hazan, C. Seshadhri

claim paper

Read More »

click to vote

CORR
2008
Springer

172views Education» more CORR 2008»

Altruism in Congestion Games

13 years 5 months ago

Download algo.rwth-aachen.de

This paper studies the effects of introducing altruistic agents into atomic congestion games. Altruistic behavior is modeled by a trade-off between selfish and social objectives. ...

Martin Hoefer, Alexander Skopalik

claim paper

Read More »

« Prev « First page 2 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers