Sciweavers

43 search results - page 9 / 9
» Training Reinforcement Neurocontrollers Using the Polytope A...
Sort
View
NIPS
2001
13 years 7 months ago
Improvisation and Learning
This article presents a 2-phase computational learning model and application. As a demonstration, a system has been built, called CHIME for Computer Human Interacting Musical Enti...
Judy A. Franklin
ICML
2010
IEEE
13 years 6 months ago
Toward Off-Policy Learning Control with Function Approximation
We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...
Hamid Reza Maei, Csaba Szepesvári, Shalabh ...
ROBOCUP
2004
Springer
147views Robotics» more  ROBOCUP 2004»
13 years 11 months ago
Learning to Drive and Simulate Autonomous Mobile Robots
We show how to apply learning methods to two robotics problems, namely the optimization of the on-board controller of an omnidirectional robot, and the derivation of a model of the...
Alexander Gloye, Cüneyt Göktekin, Anna E...