Search Sciweavers | Sciweavers

135 search results - page 14 / 27

» Using Reinforcement Learning to Coordinate Better

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

14 years 11 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

click to vote

ECAL
2005
Springer

119views Artificial Intelligence» more ECAL 2005»

The Quantitative Law of Effect is a Robust Emergent Property of an Evolutionary Algorithm for Reinforcement Learning

15 years 3 months ago

Download www.psychology.emory.edu

An evolutionary reinforcement-learning algorithm, the operation of which was not associated with an optimality condition, was instantiated in an artificial organism. The algorithm ...

J. J. McDowell, Zahra Ansari

claim paper

Read More »

121

click to vote

AI
1998
Springer

177views Artificial Intelligence» more AI 1998»

Model-Based Average Reward Reinforcement Learning

14 years 9 months ago

Download web.engr.oregonstate.edu

Reinforcement Learning (RL) is the study of programs that improve their performance by receiving rewards and punishments from the environment. Most RL methods optimize the discoun...

Prasad Tadepalli, DoKyeong Ok

claim paper

Read More »

click to vote

ATAL
2007
Springer

69views Intelligent Agents» more ATAL 2007»

Using priorities to simplify behavior coordination

15 years 3 months ago

Download www.cs.ou.edu

Real-world behavior-based robot control problems require the coordination of a large number of competing behaviors. However, coordination becomes increasingly diﬃcult as the num...

Brent E. Eskridge, Dean F. Hougen

claim paper

Read More »

click to vote

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

14 years 7 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

« Prev « First page 14 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers