Search Sciweavers | Sciweavers

51 search results - page 3 / 11

» Improving Approximate Value Iteration Using Memories and Pre...

click to vote

UAI
2008

192views Artificial Intelligence» more UAI 2008»

Sparse Stochastic Finite-State Controllers for POMDPs

13 years 6 months ago

Download www.aaai.org

Bounded policy iteration is an approach to solving infinitehorizon POMDPs that represents policies as stochastic finitestate controllers and iteratively improves a controller by a...

Eric A. Hansen

claim paper

Read More »

click to vote

AAMAS
2007
Springer

157views Intelligent Agents» more AAMAS 2007»

Continuous-State Reinforcement Learning with Fuzzy Approximation

13 years 11 months ago

Download www.montefiore.ulg.ac.be

Abstract. Reinforcement learning (RL) is a widely used learning paradigm for adaptive agents. There exist several convergent and consistent RL algorithms which have been intensivel...

Lucian Busoniu, Damien Ernst, Bart De Schutter, Ro...

claim paper

Read More »

click to vote

CORR
2010
Springer

204views Education» more CORR 2010»

Predictive State Temporal Difference Learning

13 years 3 months ago

Download www.cs.cmu.edu

We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identiﬁcation. In practical applications...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

13 years 10 months ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

click to vote

ECAI
2008
Springer

83views Artificial Intelligence» more ECAI 2008»

Reinforcement Learning with the Use of Costly Features

13 years 6 months ago

Download people.cs.kuleuven.be

In many practical reinforcement learning problems, the state space is too large to permit an exact representation of the value function, much less the time required to compute it. ...

Robby Goetschalckx, Scott Sanner, Kurt Driessens

claim paper

Read More »

« Prev « First page 3 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers