Search Sciweavers | Sciweavers

46 search results - page 2 / 10

» Improve Searching by Reinforcement Learning in Unstructured ...

100

click to vote

IJCAI
2001

163views Artificial Intelligence» more IJCAI 2001»

Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning

15 years 1 months ago

Download www.cs.colorado.edu

Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

click to vote

IJCAI
2003

169views Artificial Intelligence» more IJCAI 2003»

Covariant Policy Search

15 years 29 days ago

Download www.ri.cmu.edu

We investigate the problem of non-covariant behavior of policy gradient reinforcement learning algorithms. The policy gradient approach is amenable to analysis by information geom...

J. Andrew Bagnell, Jeff G. Schneider

claim paper

Read More »

click to vote

ICMLA
2003

159views Machine Learning» more ICMLA 2003»

A Distributed Reinforcement Learning Approach to Pattern Inference in Go

15 years 29 days ago

Download mysite.verizon.net

— This paper shows that the distributed representation found in Learning Vector Quantization (LVQ) enables reinforcement learning methods to cope with a large decision search spa...

Myriam Abramson, Harry Wechsler

claim paper

Read More »

120

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

15 years 5 months ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

click to vote

ICML
2005
IEEE

188views Machine Learning» more ICML 2005»

High speed obstacle avoidance using monocular vision and reinforcement learning

16 years 12 days ago

Download ai.stanford.edu

We consider the task of driving a remote control car at high speeds through unstructured outdoor environments. We present an approach in which supervised learning is first used to...

Jeff Michels, Ashutosh Saxena, Andrew Y. Ng

claim paper

Read More »

« Prev « First page 2 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers