Search Sciweavers | Sciweavers

161 search results - page 26 / 33

» Convergence Problems of General-Sum Multiagent Reinforcement...

215

click to vote

ICML
2010
IEEE

222views Machine Learning» more ICML 2010»

Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda

15 years 5 months ago

Download www.icml2010.org

Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...

Carlton Downey, Scott Sanner

claim paper

Read More »

213

click to vote

ICML
2001
IEEE

185views Machine Learning» more ICML 2001»

Off-Policy Temporal Difference Learning with Function Approximation

16 years 8 months ago

Download www.cs.ualberta.ca

We introduce the first algorithm for off-policy temporal-difference learning that is stable with linear function approximation. Off-policy learning is of interest because it forms...

Doina Precup, Richard S. Sutton, Sanjoy Dasgupta

claim paper

Read More »

209

click to vote

ATAL
2008
Springer

184views Intelligent Agents» more ATAL 2008»

Sequential decision making with untrustworthy service providers

15 years 9 months ago

Download www.aamas-conference.org

In this paper, we deal with the sequential decision making problem of agents operating in computational economies, where there is uncertainty regarding the trustworthiness of serv...

W. T. Luke Teacy, Georgios Chalkiadakis, Alex Roge...

claim paper

Read More »

200

click to vote

AAAI
2010

191views Intelligent Agents» more AAAI 2010»

Relative Entropy Policy Search

15 years 9 months ago

Download www.kyb.tuebingen.mpg.de

Policy search is a successful approach to reinforcement learning. However, policy improvements often result in the loss of information. Hence, it has been marred by premature conv...

Jan Peters, Katharina Mülling, Yasemin Altun

claim paper

Read More »

202

click to vote

ATAL
2009
Springer

209views Intelligent Agents» more ATAL 2009»

Adaptive learning in evolving task allocation networks

16 years 2 months ago

Download ifaamas.org

In this paper, we study multi-agent economic systems using a recent approach to economic modeling called Agent-based Computational Economics (ACE): the application of the Complex ...

Tomas Klos, Bart Nooteboom

claim paper

Read More »

« Prev « First page 26 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers