Sciweavers

13 search results - page 2 / 3
» Model-free reinforcement learning as mixture learning
Sort
View
ICMLA
2010
13 years 3 months ago
Multimodal Parameter-exploring Policy Gradients
Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...
Frank Sehnke, Alex Graves, Christian Osendorfer, J...
NIPS
2008
13 years 6 months ago
Structure Learning in Human Sequential Decision-Making
We use graphical models and structure learning to explore how people learn policies in sequential decision making tasks. Studies of sequential decision-making in humans frequently...
Daniel Acuña, Paul R. Schrater
ML
2002
ACM
100views Machine Learning» more  ML 2002»
13 years 5 months ago
Structure in the Space of Value Functions
Solving in an efficient manner many different optimal control tasks within the same underlying environment requires decomposing the environment into its computationally elemental ...
David J. Foster, Peter Dayan
EWCBR
2008
Springer
13 years 7 months ago
Forgetting Reinforced Cases
To meet time constraints, a CBR system must control the time spent searching in the case base for a solution. In this paper, we presents the results of a case study comparing the p...
Houcine Romdhane, Luc Lamontagne
CCIA
2010
Springer
13 years 10 days ago
Learning Force-Based Robot Skills from Haptic Demonstration
Locally weighted as well as Gaussian mixtures learning algorithms are suitable strategies for trajectory learning and skill acquisition, in the context of programming by demonstrat...
Leonel Rozo, Pablo Jiménez, Carme Torras