Search Sciweavers | Sciweavers

69

SIAMCO
2000

117views more SIAMCO 2000»

The O.D.E. Method for Convergence of Stochastic Approximation and Reinforcement Learning

14 years 9 months ago

It is shown here that stability of the stochastic approximation algorithm is implied by the asymptotic stability of the origin for an associated ODE. This in turn implies convergen...

Vivek S. Borkar, Sean P. Meyn

claim paper

Read More »

82

click to vote

ESANN
2003

151views Neural Networks» more ESANN 2003»

Accelerating the convergence speed of neural networks learning methods using least squares

14 years 11 months ago

Download www.dice.ucl.ac.be

In this work a hybrid training scheme for the supervised learning of feedforward neural networks is presented. In the proposed method, the weights of the last layer are obtained em...

Oscar Fontenla-Romero, Deniz Erdogmus, José...

claim paper

Read More »

93

Voted

ICML
2000
IEEE

192views Machine Learning» more ICML 2000»

Convergence Problems of General-Sum Multiagent Reinforcement Learning

15 years 10 months ago

Download www.cs.ualberta.ca

Stochastic games are a generalization of MDPs to multiple agents, and can be used as a framework for investigating multiagent learning. Hu and Wellman (1998) recently proposed a m...

Michael H. Bowling

claim paper

Read More »

105

click to vote

NIPS
2008

130views Information Technology» more NIPS 2008»

Temporal Difference Based Actor Critic Learning - Convergence and Neural Implementation

14 years 11 months ago

Download eprints.pascal-network.org

Actor-critic algorithms for reinforcement learning are achieving renewed popularity due to their good convergence properties in situations where other approaches often fail (e.g.,...

Dotan Di Castro, Dmitry Volkinshtein, Ron Meir

claim paper

Read More »

69

click to vote

FUZZIEEE
2007
IEEE

132views Fuzzy Logic» more FUZZIEEE 2007»

Fuzzy Approximation for Convergent Model-Based Reinforcement Learning

15 years 4 months ago

Download www.montefiore.ulg.ac.be

— Reinforcement learning (RL) is a learning control paradigm that provides well-understood algorithms with good convergence and consistency properties. Unfortunately, these algor...

Lucian Busoniu, Damien Ernst, Bart De Schutter, Ro...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers