Sciweavers

2108 search results - page 5 / 422
» Tracking in Reinforcement Learning
Sort
View
ICML
1998
IEEE
16 years 3 days ago
Multi-criteria Reinforcement Learning
Csaba Szepesvári, Zoltán Gábo...
ICML
1996
IEEE
16 years 3 days ago
On-Line Adaptation of a Signal Predistorter through Dual Reinforcement Learning
Patrick Goetz, Shailesh Kumar, Risto Miikkulainen
CORR
1998
Springer
164views Education» more  CORR 1998»
14 years 11 months ago
Training Reinforcement Neurocontrollers Using the Polytope Algorithm
A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorit...
Aristidis Likas, Isaac E. Lagaris
ESANN
2006
15 years 21 days ago
Reducing policy degradation in neuro-dynamic programming
We focus on neuro-dynamic programming methods to learn state-action value functions and outline some of the inherent problems to be faced, when performing reinforcement learning in...
Thomas Gabel, Martin Riedmiller