Search Sciweavers | Sciweavers

51 search results - page 2 / 11

» Characterizing reinforcement learning methods through parame...

click to vote

IJCAI
2003

169views Artificial Intelligence» more IJCAI 2003»

Covariant Policy Search

13 years 6 months ago

Download www.ri.cmu.edu

We investigate the problem of non-covariant behavior of policy gradient reinforcement learning algorithms. The policy gradient approach is amenable to analysis by information geom...

J. Andrew Bagnell, Jeff G. Schneider

claim paper

Read More »

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

13 years 10 months ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

click to vote

ICML
2007
IEEE

202views Machine Learning» more ICML 2007»

Cross-domain transfer for reinforcement learning

14 years 5 months ago

Download www.machinelearning.org

A typical goal for transfer learning algorithms is to utilize knowledge gained in a source task to learn a target task faster. Recently introduced transfer methods in reinforcemen...

Matthew E. Taylor, Peter Stone

claim paper

Read More »

click to vote

AAAI
2006

161views Intelligent Agents» more AAAI 2006»

Sample-Efficient Evolutionary Function Approximation for Reinforcement Learning

13 years 6 months ago

Download staff.science.uva.nl

Reinforcement learning problems are commonly tackled with temporal difference methods, which attempt to estimate the agent's optimal value function. In most real-world proble...

Shimon Whiteson, Peter Stone

claim paper

Read More »

click to vote

AAAI
2000

139views Intelligent Agents» more AAAI 2000»

Localizing Search in Reinforcement Learning

13 years 6 months ago

Download www.cs.colorado.edu

Reinforcement learning (RL) can be impractical for many high dimensional problems because of the computational cost of doing stochastic search in large state spaces. We propose a ...

Gregory Z. Grudic, Lyle H. Ungar

claim paper

Read More »

« Prev « First page 2 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers