Sciweavers

1233 search results - page 3 / 247
» Reinforcement learning
Sort
View
CORR
1998
Springer
164views Education» more  CORR 1998»
14 years 11 months ago
Training Reinforcement Neurocontrollers Using the Polytope Algorithm
A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorit...
Aristidis Likas, Isaac E. Lagaris
IAT
2003
IEEE
15 years 5 months ago
Asymmetric Multiagent Reinforcement Learning
A gradient-based method for both symmetric and asymmetric multiagent reinforcement learning is introduced in this paper. Symmetric multiagent reinforcement learning addresses the ...
Ville Könönen
80
Voted
NECO
2002
105views more  NECO 2002»
14 years 11 months ago
Multiple Model-Based Reinforcement Learning
We propose a modular reinforcement learning architecture for non-linear, nonstationary control tasks, which we call multiple model-based reinforcement learning (MMRL). The basic i...
Kenji Doya, Kazuyuki Samejima, Ken-ichi Katagiri, ...
84
Voted
ICML
2000
IEEE
16 years 17 days ago
Algorithms for Inverse Reinforcement Learning
Andrew Y. Ng, Stuart J. Russell