Search Sciweavers | Sciweavers

86 search results - page 3 / 18

» Estimation and Approximation Bounds for Gradient-Based Reinf...

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

13 years 11 months ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

click to vote

ATAL
2007
Springer

162views Intelligent Agents» more ATAL 2007»

Model-based function approximation in reinforcement learning

13 years 12 months ago

Download userweb.cs.utexas.edu

Reinforcement learning promises a generic method for adapting agents to arbitrary tasks in arbitrary stochastic environments, but applying it to new real-world problems remains di...

Nicholas K. Jong, Peter Stone

claim paper

Read More »

click to vote

ICML
2003
IEEE

157views Machine Learning» more ICML 2003»

Action Elimination and Stopping Conditions for Reinforcement Learning

14 years 6 months ago

Download www.hpl.hp.com

We consider incorporating action elimination procedures in reinforcement learning algorithms. We suggest a framework that is based on learning an upper and a lower estimates of th...

Eyal Even-Dar, Shie Mannor, Yishay Mansour

claim paper

Read More »

click to vote

ICANN
2009
Springer

123views Neural Networks» more ICANN 2009»

Efficient Uncertainty Propagation for Reinforcement Learning with Limited Data

13 years 9 months ago

Download www.tu-ilmenau.de

In a typical reinforcement learning (RL) setting details of the environment are not given explicitly but have to be estimated from observations. Most RL approaches only optimize th...

Alexander Hans, Steffen Udluft

claim paper

Read More »

click to vote

JMLR
2010

125views more JMLR 2010»

Variational methods for Reinforcement Learning

13 years 17 days ago

Download jmlr.csail.mit.edu

We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...

Thomas Furmston, David Barber

claim paper

Read More »

« Prev « First page 3 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers