Search Sciweavers | Sciweavers

582 search results - page 47 / 117

» Gaussian Processes in Reinforcement Learning

131

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

16 years 5 months ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

218

Voted

ILP
2007
Springer

283views Automated Reasoning» more ILP 2007»

Building Relational World Models for Reinforcement Learning

15 years 10 months ago

Download ftp.cs.wisc.edu

Abstract. Many reinforcement learning domains are highly relational. While traditional temporal-difference methods can be applied to these domains, they are limited in their capaci...

Trevor Walker, Lisa Torrey, Jude W. Shavlik, Richa...

claim paper

Read More »

125

click to vote

FUZZIEEE
2007
IEEE

132views Fuzzy Logic» more FUZZIEEE 2007»

Fuzzy Approximation for Convergent Model-Based Reinforcement Learning

15 years 10 months ago

Download www.montefiore.ulg.ac.be

— Reinforcement learning (RL) is a learning control paradigm that provides well-understood algorithms with good convergence and consistency properties. Unfortunately, these algor...

Lucian Busoniu, Damien Ernst, Bart De Schutter, Ro...

claim paper

Read More »

141

Voted

ICRA
2006
IEEE

131views Robotics» more ICRA 2006»

Using Reinforcement Learning to Improve Exploration Trajectories for Error Minimization

15 years 10 months ago

Download mapleleaf.csail.mit.edu

Abstract— The mapping and localization problems have received considerable attention in robotics recently. The exploration problem that drives mapping has started to generate sim...

Thomas Kollar, Nicholas Roy

claim paper

Read More »

130

Voted

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 5 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

« Prev « First page 47 / 117 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers