Sciweavers

2011 search results - page 5 / 403
» Universal Reinforcement Learning
Sort
View
74
Voted
ICML
1998
IEEE
16 years 14 days ago
Multi-criteria Reinforcement Learning
Csaba Szepesvári, Zoltán Gábo...
75
Voted
ICML
1998
IEEE
16 years 14 days ago
An Analysis of Direct Reinforcement Learning in Non-Markovian Domains
Mark D. Pendrith, Michael McGarity
76
Voted
ICML
1996
IEEE
16 years 14 days ago
On-Line Adaptation of a Signal Predistorter through Dual Reinforcement Learning
Patrick Goetz, Shailesh Kumar, Risto Miikkulainen
CORR
1998
Springer
164views Education» more  CORR 1998»
14 years 11 months ago
Training Reinforcement Neurocontrollers Using the Polytope Algorithm
A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorit...
Aristidis Likas, Isaac E. Lagaris
82
Voted
ESANN
2006
15 years 1 months ago
Reducing policy degradation in neuro-dynamic programming
We focus on neuro-dynamic programming methods to learn state-action value functions and outline some of the inherent problems to be faced, when performing reinforcement learning in...
Thomas Gabel, Martin Riedmiller