Search Sciweavers | Sciweavers

779 search results - page 21 / 156

» Reinforcement Using Supervised Learning for Policy Generaliz...

116

click to vote

ECML
2007
Springer

192views Machine Learning» more ECML 2007»

Policy Gradient Critics

15 years 8 months ago

Download www.idsia.ch

We present Policy Gradient Actor-Critic (PGAC), a new model-free Reinforcement Learning (RL) method for creating limited-memory stochastic policies for Partially Observable Markov ...

Daan Wierstra, Jürgen Schmidhuber

claim paper

Read More »

130

Voted

ICANNGA
2007
Springer

105views Algorithms» more ICANNGA 2007»

Reinforcement Learning in Fine Time Discretization

15 years 8 months ago

Download staff.elka.pw.edu.pl

Reinforcement Learning (RL) is analyzed here as a tool for control system optimization. State and action spaces are assumed to be continuous. Time is assumed to be discrete, yet th...

Pawel Wawrzynski

claim paper

Read More »

101

Voted

FLAIRS
2006

103views Artificial Intelligence» more FLAIRS 2006»

Using Active Relocation to Aid Reinforcement Learning

15 years 4 months ago

Download www.cs.utexas.edu

We propose a new framework for aiding a reinforcement learner by allowing it to relocate, or move, to a state it selects so as to decrease the number of steps it needs to take in ...

Lilyana Mihalkova, Raymond J. Mooney

claim paper

Read More »

111

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

16 years 3 months ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

108

click to vote

MICAI
2009
Springer

188views Artificial Intelligence» more MICAI 2009»

A Two-Stage Relational Reinforcement Learning with Continuous Actions for Real Service Robots

15 years 9 months ago

Download ccc.inaoep.mx

Reinforcement Learning is a commonly used technique in robotics, however, traditional algorithms are unable to handle large amounts of data coming from the robot’s sensors, requi...

Julio H. Zaragoza, Eduardo F. Morales

claim paper

Read More »

« Prev « First page 21 / 156 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers