Sciweavers

24 search results - page 4 / 5
» Technical Update: Least-Squares Temporal Difference Learning
Sort
View
NIPS
2007
13 years 6 months ago
Incremental Natural Actor-Critic Algorithms
We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...
Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...
ICML
2007
IEEE
14 years 6 months ago
Bayesian actor-critic algorithms
We1 present a new actor-critic learning model in which a Bayesian class of non-parametric critics, using Gaussian process temporal difference learning is used. Such critics model ...
Mohammad Ghavamzadeh, Yaakov Engel
ECTEL
2007
Springer
13 years 11 months ago
Remote Cooperation on Project-centred Learning: a Working Implemented Solution in Academia
The paper aims at illustrating the original technical solution provided within an academic institute in order to manage teaching activities, encompassing the coordination of projec...
Carola Salvioni, Antonio Vincenzo Taddeo
ICALT
2003
IEEE
13 years 10 months ago
Gaining Computational Literacy by Creating Hybrid Aesthetic Learning Spaces
Although the technical skills of pupils are quite high, the current approach to gain media literacy still focusses on updating software applying skills, rather than exploring the ...
Daniela Reimann, Michael Herczeg, Thomas Winkler, ...
JMLR
2006
153views more  JMLR 2006»
13 years 5 months ago
Collaborative Multiagent Reinforcement Learning by Payoff Propagation
In this article we describe a set of scalable techniques for learning the behavior of a group of agents in a collaborative multiagent setting. As a basis we use the framework of c...
Jelle R. Kok, Nikos A. Vlassis