Sciweavers

1799 search results - page 5 / 360
» Filtered Reinforcement Learning
Sort
View
ICML
1998
IEEE
15 years 12 months ago
Multi-criteria Reinforcement Learning
Csaba Szepesvári, Zoltán Gábo...
ICML
1996
IEEE
15 years 12 months ago
On-Line Adaptation of a Signal Predistorter through Dual Reinforcement Learning
Patrick Goetz, Shailesh Kumar, Risto Miikkulainen
AROBOTS
2008
131views more  AROBOTS 2008»
14 years 10 months ago
Active audition using the parameter-less self-organising map
This paper presents a novel method for enabling a robot to determine the position of a sound source in three dimensions using just two microphones and interaction with its environm...
Erik Berglund, Joaquin Sitte, Gordon Wyeth
CORR
1998
Springer
164views Education» more  CORR 1998»
14 years 10 months ago
Training Reinforcement Neurocontrollers Using the Polytope Algorithm
A new training algorithm is presented for delayed reinforcement learning problems that does not assume the existence of a critic model and employs the polytope optimization algorit...
Aristidis Likas, Isaac E. Lagaris