Search Sciweavers | Sciweavers

69 search results - page 3 / 14

» PAC-Bayesian Policy Evaluation for Reinforcement Learning

155

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

16 years 6 months ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

171

click to vote

ICRA
2010
IEEE

145views Robotics» more ICRA 2010»

Reinforcement learning of motor skills in high dimensions: A path integral approach

15 years 4 months ago

Download www-personal.acfr.usyd.edu.au

— Reinforcement learning (RL) is one of the most general approaches to learning control. Its applicability to complex motor systems, however, has been largely impossible so far d...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

137

click to vote

ICONIP
2009

107views Information Technology» more ICONIP 2009»

Tracking in Reinforcement Learning

15 years 3 months ago

Download www.metz.supelec.fr

Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the...

Matthieu Geist, Olivier Pietquin, Gabriel Fricout

claim paper

Read More »

166

Voted

IJCNN
2006
IEEE

127views Neural Networks» more IJCNN 2006»

Reinforcement Learning for Parameterized Motor Primitives

16 years 3 days ago

Download www-clmc.usc.edu

Abstract— One of the major challenges in both action generation for robotics and in the understanding of human motor control is to learn the “building blocks of movement genera...

Jan Peters, Stefan Schaal

claim paper

Read More »

189

Voted

ECML
2005
Springer

193views Machine Learning» more ECML 2005»

Natural Actor-Critic

15 years 11 months ago

Download www-clmc.usc.edu

This paper investigates a novel model-free reinforcement learning architecture, the Natural Actor-Critic. The actor updates are based on stochastic policy gradients employing Amari...

Jan Peters, Sethu Vijayakumar, Stefan Schaal

claim paper

Read More »

« Prev « First page 3 / 14 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers