Search Sciweavers | Sciweavers

13

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

14 years 5 months ago

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

18

click to vote

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

13 years 5 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

11

click to vote

COLT
2008
Springer

132views Machine Learning» more COLT 2008»

Adaptive Aggregation for Reinforcement Learning with Efficient Exploration: Deterministic Domains

13 years 6 months ago

Download colt2008.cs.helsinki.fi

We propose a model-based learning algorithm, the Adaptive Aggregation Algorithm (AAA), that aims to solve the online, continuous state space reinforcement learning problem in a de...

Andrey Bernstein, Nahum Shimkin

claim paper

Read More »

12

click to vote

ESANN
2008

125views Neural Networks» more ESANN 2008»

Improvement in Game Agent Control Using State-Action Value Scaling

13 years 5 months ago

Download www.dice.ucl.ac.be

The aim of this paper is to enhance the performance of a reinforcement learning game agent controller, within a dynamic game environment, through the retention of learned informati...

Leo Galway, Darryl Charles, Michaela M. Black

claim paper

Read More »

18

click to vote

JMLR
2010

148views more JMLR 2010»

A Generalized Path Integral Control Approach to Reinforcement Learning

12 years 11 months ago

Download jmlr.csail.mit.edu

With the goal to generate more scalable algorithms with higher efficiency and fewer open parameters, reinforcement learning (RL) has recently moved towards combining classical tec...

Evangelos Theodorou, Jonas Buchli, Stefan Schaal

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers