Sciweavers

17 search results - page 2 / 4
» Constructing action set from basis functions for reinforceme...
Sort
View
NEUROSCIENCE
2001
Springer
13 years 9 months ago
Role of the Cerebellum in Time-Critical Goal-Oriented Behaviour: Anatomical Basis and Control Principle
The Brain is a slow computer yet humans can skillfully play games such as tennis where very fast reactions are required. Of particular interest is the evidence for strategic thinki...
Guido Bugmann
ICML
2010
IEEE
13 years 5 months ago
Toward Off-Policy Learning Control with Function Approximation
We present the first temporal-difference learning algorithm for off-policy control with unrestricted linear function approximation whose per-time-step complexity is linear in the ...
Hamid Reza Maei, Csaba Szepesvári, Shalabh ...
IWINAC
2007
Springer
13 years 11 months ago
Evolving Robot Behaviour at Micro (Molecular) and Macro (Molar) Action Level
We investigate how it is possible to shape robot behaviour adopting a molecular or molar point of view. These two ways to approach the issue are inspired by Learning Psychology, wh...
Michela Ponticorvo, Orazio Miglino
IUI
2000
ACM
13 years 9 months ago
Creating an empirical basis for adaptation decisions
CT How can an adaptive intelligent interface decide what particular action to perform in a given situation, as a function of perceived properties of the user and the situation? Ide...
Anthony Jameson, Barbara Großmann-Hutter, Le...
ATAL
2007
Springer
13 years 11 months ago
Transfer via inter-task mappings in policy search reinforcement learning
The ambitious goal of transfer learning is to accelerate learning on a target task after training on a different, but related, source task. While many past transfer methods have f...
Matthew E. Taylor, Shimon Whiteson, Peter Stone