Sciweavers

10 search results - page 2 / 2
» Natural Actor-Critic
Sort
View
MDAI
2010
Springer
13 years 4 months ago
Revisiting Natural Actor-Critics with Value Function Approximation
Actor-critics architectures have become popular during the last decade
Matthieu Geist, Olivier Pietquin
IJCNN
2006
IEEE
14 years 8 days ago
Reinforcement Learning for Parameterized Motor Primitives
Abstract— One of the major challenges in both action generation for robotics and in the understanding of human motor control is to learn the “building blocks of movement genera...
Jan Peters, Stefan Schaal
TSMC
2008
132views more  TSMC 2008»
13 years 6 months ago
Ensemble Algorithms in Reinforcement Learning
This paper describes several ensemble methods that combine multiple different reinforcement learning (RL) algorithms in a single agent. The aim is to enhance learning speed and fin...
Marco A. Wiering, Hado van Hasselt
CSL
2010
Springer
13 years 6 months ago
Bayesian update of dialogue state: A POMDP framework for spoken dialogue systems
This paper describes a statistically motivated framework for performing real-time dialogue state updates and policy learning in a spoken dialogue system. The framework is based on...
Blaise Thomson, Steve Young