Search Sciweavers | Sciweavers

162 search results - page 11 / 33

» Off-Policy Temporal Difference Learning with Function Approx...

205

click to vote

UAI
2008

242views Artificial Intelligence» more UAI 2008»

Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping

15 years 7 months ago

Download uai2008.cs.helsinki.fi

We consider the problem of efficiently learning optimal control policies and value functions over large state spaces in an online setting in which estimates must be available afte...

Richard S. Sutton, Csaba Szepesvári, Alborz...

claim paper

Read More »

184

click to vote

ESANN
2001

116views Neural Networks» more ESANN 2001»

Learning fault-tolerance in Radial Basis Function Networks

15 years 7 months ago

Download www.dice.ucl.ac.be

This paper describes a method of supervised learning based on forward selection branching. This method improves fault tolerance by means of combining information related to general...

Xavier Parra, Andreu Català

claim paper

Read More »

167

Voted

IJON
2006

90views more IJON 2006»

Reinforcement learning of a simple control task using the spike response model

15 years 6 months ago

Download www.xdr.com

In this work, we propose a variation of a direct reinforcement learning algorithm, suitable for usage with spiking neurons based on the spike response model (SRM). The SRM is a bi...

Murilo Saraiva de Queiroz, Roberto Coelho de Berr&...

claim paper

Read More »

142

click to vote

ICONIP
2009

107views Information Technology» more ICONIP 2009»

Tracking in Reinforcement Learning

15 years 4 months ago

Download www.metz.supelec.fr

Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

189

click to vote

FLAIRS
2003

195views Artificial Intelligence» more FLAIRS 2003»

Learning Opening Strategy in the Game of Go

15 years 7 months ago

Download vision.middlebury.edu

In this paper, we present an experimental methodology and results for a machine learning approach to learning opening strategy in the game of Go, a game for which the best compute...

Timothy Huang, Graeme Connell, Bryan McQuade

claim paper

Read More »

« Prev « First page 11 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers