Search Sciweavers | Sciweavers

5 search results - page 1 / 1

» Least-Squares Policy Iteration: Bias-Variance Trade-off in C...

Voted

ICML
2010
IEEE

202views Machine Learning» more ICML 2010»

Least-Squares Policy Iteration: Bias-Variance Trade-off in Control Problems

14 years 10 months ago

Download www.icml2010.org

Christophe Thiery, Bruno Scherrer

claim paper

Read More »

click to vote

NIPS
2001

206views Information Technology» more NIPS 2001»

Model-Free Least-Squares Policy Iteration

14 years 11 months ago

Download www.cs.duke.edu

We propose a new approach to reinforcement learning which combines least squares function approximation with policy iteration. Our method is model-free and completely off policy. ...

Michail G. Lagoudakis, Ronald Parr

claim paper

Read More »

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

15 years 4 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

Voted

AAAI
2010

206views Intelligent Agents» more AAAI 2010»

Decision-Theoretic Control of Crowd-Sourced Workflows

14 years 11 months ago

Download www.cs.washington.edu

Crowd-sourcing is a recent framework in which human intelligence tasks are outsourced to a crowd of unknown people ("workers") as an open call (e.g., on Amazon's Me...

Peng Dai, Mausam, Daniel S. Weld

claim paper

Read More »

click to vote

ICML
2009
IEEE

194views Machine Learning» more ICML 2009»

Binary action search for learning continuous-action control policies

15 years 10 months ago

Download www.intelligence.tuc.gr

Reinforcement Learning methods for controlling stochastic processes typically assume a small and discrete action space. While continuous action spaces are quite common in real-wor...

Jason Pazis, Michail G. Lagoudakis

claim paper

Read More »

« Prev « First page 1 / 1 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers