Search Sciweavers | Sciweavers

417 search results - page 2 / 84

» Reinforcement Learning Estimation of Distribution Algorithm

click to vote

ICML
1994
IEEE

151views Machine Learning» more ICML 1994»

Learning Without State-Estimation in Partially Observable Markovian Decision Processes

13 years 9 months ago

Download www.eecs.umich.edu

Reinforcement learning (RL) algorithms provide a sound theoretical basis for building learning control architectures for embedded agents. Unfortunately all of the theory and much ...

Satinder P. Singh, Tommi Jaakkola, Michael I. Jord...

claim paper

Read More »

click to vote

IJCAI
2001

163views Artificial Intelligence» more IJCAI 2001»

Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning

13 years 7 months ago

Download www.cs.colorado.edu

Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

click to vote

NECO
2010

103views more NECO 2010»

Posterior Weighted Reinforcement Learning with State Uncertainty

13 years 4 months ago

Download www.maths.bris.ac.uk

Reinforcement learning models generally assume that a stimulus is presented that allows a learner to unambiguously identify the state of nature, and the reward received is drawn f...

Tobias Larsen, David S. Leslie, Edmund J. Collins,...

claim paper

Read More »

click to vote

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

13 years 3 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

click to vote

ICMLA
2003

159views Machine Learning» more ICMLA 2003»

A Distributed Reinforcement Learning Approach to Pattern Inference in Go

13 years 7 months ago

Download mysite.verizon.net

— This paper shows that the distributed representation found in Learning Vector Quantization (LVQ) enables reinforcement learning methods to cope with a large decision search spa...

Myriam Abramson, Harry Wechsler

claim paper

Read More »

« Prev « First page 2 / 84 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers