Search Sciweavers | Sciweavers

397 search results - page 32 / 80

» Reinforcement Learning with Hierarchies of Machines

113

click to vote

ALT
2006
Springer

146views Machine Learning» more ALT 2006»

Probabilistic Generalization of Simple Grammars and Its Application to Reinforcement Learning

15 years 8 months ago

Download www.logos.t.u-tokyo.ac.jp

Abstract. Recently, some non-regular subclasses of context-free grammars have been found to be eﬃciently learnable from positive data. In order to use these eﬃcient algorithms ...

Takeshi Shibata, Ryo Yoshinaka, Takashi Chikayama

claim paper

Read More »

click to vote

ICML
2003
IEEE

104views Machine Learning» more ICML 2003»

The Influence of Reward on the Speed of Reinforcement Learning: An Analysis of Shaping

15 years 5 months ago

Download www.hpl.hp.com

Shaping can be an effective method for improving the learning rate in reinforcement systems. Previously, shaping has been heuristically motivated and implemented. We provide a for...

Adam Laud, Gerald DeJong

claim paper

Read More »

103

click to vote

ICMLA
2010

211views Machine Learning» more ICMLA 2010»

Ensembles of Neural Networks for Robust Reinforcement Learning

14 years 9 months ago

Download ahans.de

Reinforcement learning algorithms that employ neural networks as function approximators have proven to be powerful tools for solving optimal control problems. However, their traini...

Alexander Hans, Steffen Udluft

claim paper

Read More »

click to vote

ICML
2006
IEEE

103views Machine Learning» more ICML 2006»

Using inaccurate models in reinforcement learning

16 years 19 days ago

Download ai.stanford.edu

In the model-based policy search approach to reinforcement learning (RL), policies are found using a model (or "simulator") of the Markov decision process. However, for ...

Pieter Abbeel, Morgan Quigley, Andrew Y. Ng

claim paper

Read More »

click to vote

ICML
2004
IEEE

145views Machine Learning» more ICML 2004»

Convergence of synchronous reinforcement learning with linear function approximation

16 years 19 days ago

Download www.machinelearning.org

Synchronous reinforcement learning (RL) algorithms with linear function approximation are representable as inhomogeneous matrix iterations of a special form (Schoknecht & Merk...

Artur Merke, Ralf Schoknecht

claim paper

Read More »

« Prev « First page 32 / 80 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers