Search Sciweavers | Sciweavers

2108 search results - page 56 / 422

» Tracking in Reinforcement Learning

223

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

16 years 14 days ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

251

click to vote

ABIALS
2008
Springer

255views Artificial Intelligence» more ABIALS 2008»

Multiscale Anticipatory Behavior by Hierarchical Reinforcement Learning

15 years 9 months ago

Download axon.cs.byu.edu

Abstract. In order to establish autonomous behavior for technical systems, the well known trade-off between reactive control and deliberative planning has to be considered. Within ...

Matthias Rungger, Hao Ding, Olaf Stursberg

claim paper

Read More »

180

click to vote

IJON
2006

90views more IJON 2006»

Reinforcement learning of a simple control task using the spike response model

15 years 6 months ago

Download www.xdr.com

In this work, we propose a variation of a direct reinforcement learning algorithm, suitable for usage with spiking neurons based on the spike response model (SRM). The SRM is a bi...

Murilo Saraiva de Queiroz, Roberto Coelho de Berr&...

claim paper

Read More »

168

click to vote

GECCO
2005
Springer

155views Optimization» more GECCO 2005»

Co-evolving recurrent neurons learn deep memory POMDPs

16 years 13 days ago

Download www.idsia.ch

Recurrent neural networks are theoretically capable of learning complex temporal sequences, but training them through gradient-descent is too slow and unstable for practical use i...

Faustino J. Gomez, Jürgen Schmidhuber

claim paper

Read More »

203

click to vote

ATAL
2008
Springer

160views Intelligent Agents» more ATAL 2008»

Sequential decision making in repeated coalition formation under uncertainty

15 years 9 months ago

Download www.aamas-conference.org

The problem of coalition formation when agents are uncertain about the types or capabilities of their potential partners is a critical one. In [3] a Bayesian reinforcement learnin...

Georgios Chalkiadakis, Craig Boutilier

claim paper

Read More »

« Prev « First page 56 / 422 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers