Search Sciweavers | Sciweavers

13 search results - page 2 / 3

» Model-free reinforcement learning as mixture learning

click to vote

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

13 years 3 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

click to vote

NIPS
2008

129views Information Technology» more NIPS 2008»

Structure Learning in Human Sequential Decision-Making

13 years 6 months ago

Download www-users.cs.umn.edu

We use graphical models and structure learning to explore how people learn policies in sequential decision making tasks. Studies of sequential decision-making in humans frequently...

Daniel Acuña, Paul R. Schrater

claim paper

Read More »

click to vote

ML
2002
ACM

100views Machine Learning» more ML 2002»

Structure in the Space of Value Functions

13 years 5 months ago

Download www.gatsby.ucl.ac.uk

Solving in an efficient manner many different optimal control tasks within the same underlying environment requires decomposing the environment into its computationally elemental ...

David J. Foster, Peter Dayan

claim paper

Read More »

click to vote

EWCBR
2008
Springer

206views Automated Reasoning» more EWCBR 2008»

Forgetting Reinforced Cases

13 years 7 months ago

Download agora.ulaval.ca

To meet time constraints, a CBR system must control the time spent searching in the case base for a solution. In this paper, we presents the results of a case study comparing the p...

Houcine Romdhane, Luc Lamontagne

claim paper

Read More »

click to vote

CCIA
2010
Springer

296views Artificial Intelligence» more CCIA 2010»

Learning Force-Based Robot Skills from Haptic Demonstration

13 years 10 days ago

Download www.mendeley.com

Locally weighted as well as Gaussian mixtures learning algorithms are suitable strategies for trajectory learning and skill acquisition, in the context of programming by demonstrat...

Leonel Rozo, Pablo Jiménez, Carme Torras

claim paper

Read More »

« Prev « First page 2 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers