Search Sciweavers | Sciweavers

69

Voted

COLT
2001
Springer

84views Machine Learning» more COLT 2001»

15 years 2 months ago

In this paper we derive convergence rates for Q-learning. We show an interesting relationship between the convergence rate and the learning rate used in Q-learning. For a polynomi...

Eyal Even-Dar, Yishay Mansour

claim paper

Read More »

87

click to vote

ICML
2004
IEEE

145views Machine Learning» more ICML 2004»

Convergence of synchronous reinforcement learning with linear function approximation

15 years 10 months ago

Download www.machinelearning.org

Synchronous reinforcement learning (RL) algorithms with linear function approximation are representable as inhomogeneous matrix iterations of a special form (Schoknecht & Merk...

Artur Merke, Ralf Schoknecht

claim paper

Read More »

84

click to vote

LAMAS
2005
Springer

124views Intelligent Agents» more LAMAS 2005»

Unifying Convergence and No-Regret in Multiagent Learning

15 years 3 months ago

Download orca.st.usm.edu

We present a new multiagent learning algorithm, RVσ(t), that builds on an earlier version, ReDVaLeR . ReDVaLeR could guarantee (a) convergence to best response against stationary ...

Bikramjit Banerjee, Jing Peng

claim paper

Read More »

86

click to vote

ATAL
2009
Springer

184views Intelligent Agents» more ATAL 2009»

Multiagent reinforcement learning: algorithm converging to Nash equilibrium in general-sum discounted stochastic games

15 years 4 months ago

Download www.aamas-conference.org

This paper introduces a multiagent reinforcement learning algorithm that converges with a given accuracy to stationary Nash equilibria in general-sum discounted stochastic games. ...

Natalia Akchurina

claim paper

Read More »

95

click to vote

ECML
2004
Springer

112views Machine Learning» more ECML 2004»

Convergence and Divergence in Standard and Averaging Reinforcement Learning

15 years 3 months ago

Download igitur-archive.library.uu.nl

Although tabular reinforcement learning (RL) methods have been proved to converge to an optimal policy, the combination of particular conventional reinforcement learning techniques...

Marco Wiering

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers