Sciweavers

2108 search results - page 56 / 422
» Tracking in Reinforcement Learning
Sort
View
ATAL
2005
Springer
15 years 7 months ago
Improving reinforcement learning function approximators via neuroevolution
Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...
Shimon Whiteson
ABIALS
2008
Springer
15 years 3 months ago
Multiscale Anticipatory Behavior by Hierarchical Reinforcement Learning
Abstract. In order to establish autonomous behavior for technical systems, the well known trade-off between reactive control and deliberative planning has to be considered. Within ...
Matthias Rungger, Hao Ding, Olaf Stursberg
IJON
2006
90views more  IJON 2006»
15 years 1 months ago
Reinforcement learning of a simple control task using the spike response model
In this work, we propose a variation of a direct reinforcement learning algorithm, suitable for usage with spiking neurons based on the spike response model (SRM). The SRM is a bi...
Murilo Saraiva de Queiroz, Roberto Coelho de Berr&...
GECCO
2005
Springer
155views Optimization» more  GECCO 2005»
15 years 7 months ago
Co-evolving recurrent neurons learn deep memory POMDPs
Recurrent neural networks are theoretically capable of learning complex temporal sequences, but training them through gradient-descent is too slow and unstable for practical use i...
Faustino J. Gomez, Jürgen Schmidhuber
ATAL
2008
Springer
15 years 3 months ago
Sequential decision making in repeated coalition formation under uncertainty
The problem of coalition formation when agents are uncertain about the types or capabilities of their potential partners is a critical one. In [3] a Bayesian reinforcement learnin...
Georgios Chalkiadakis, Craig Boutilier