Search Sciweavers | Sciweavers

139 search results - page 23 / 28

» Model-based function approximation in reinforcement learning

170

Voted

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

16 years 1 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

208

Voted

CORR
2010
Springer

204views Education» more CORR 2010»

Predictive State Temporal Difference Learning

15 years 5 months ago

Download www.cs.cmu.edu

We propose a new approach to value function approximation which combines linear temporal difference reinforcement learning with subspace identiﬁcation. In practical applications...

Byron Boots, Geoffrey J. Gordon

claim paper

Read More »

160

click to vote

ATAL
2008
Springer

131views Intelligent Agents» more ATAL 2008»

A new perspective to the keepaway soccer: the takers

15 years 8 months ago

Download www.aamas-conference.org

Keepaway is a sub-problem of RoboCup Soccer Simulator in which 'the keepers' try to maintain the possession of the ball, while 'the takers' try to steal the ba...

Atil Iscen, Umut Erogul

claim paper

Read More »

181

click to vote

IROS
2007
IEEE

168views Robotics» more IROS 2007»

Improving humanoid locomotive performance with learnt approximated dynamics via Gaussian processes for regression

16 years 22 days ago

Download www.cs.cmu.edu

Abstract— We propose to improve the locomotive performance of humanoid robots by using approximated biped stepping and walking dynamics with reinforcement learning (RL). Although...

Jun Morimoto, Christopher G. Atkeson, Gen Endo, Go...

claim paper

Read More »

209

click to vote

Publication

222views

Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration

16 years 3 months ago

Download arxiv.org

Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

« Prev « First page 23 / 28 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers