Search Sciweavers | Sciweavers

17 search results - page 3 / 4

» Smarter Sampling in Model-Based Bayesian Reinforcement Learn...

click to vote

ICML
2000
IEEE

165views Machine Learning» more ICML 2000»

A Bayesian Framework for Reinforcement Learning

13 years 10 months ago

Download www.ece.uvic.ca

The reinforcement learning problem can be decomposed into two parallel types of inference: (i) estimating the parameters of a model for the underlying process; (ii) determining be...

Malcolm J. A. Strens

claim paper

Read More »

click to vote

ICRA
2008
IEEE

173views Robotics» more ICRA 2008»

Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

14 years 6 days ago

Download www.cs.cmu.edu

— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

click to vote

ICML
2010
IEEE

222views Machine Learning» more ICML 2010»

Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda

13 years 3 months ago

Download www.icml2010.org

Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...

Carlton Downey, Scott Sanner

claim paper

Read More »

click to vote

ICML
2009
IEEE

155views Machine Learning» more ICML 2009»

Near-Bayesian exploration in polynomial time

14 years 6 months ago

Download ai.stanford.edu

We consider the exploration/exploitation problem in reinforcement learning (RL). The Bayesian approach to model-based RL offers an elegant solution to this problem, by considering...

J. Zico Kolter, Andrew Y. Ng

claim paper

Read More »

click to vote

NIPS
2008

188views Information Technology» more NIPS 2008»

Bayesian Kernel Shaping for Learning Control

13 years 7 months ago

Download eprints.pascal-network.org

In kernel-based regression learning, optimizing each kernel individually is useful when the data density, curvature of regression surfaces (or decision boundaries) or magnitude of...

Jo-Anne Ting, Mrinal Kalakrishnan, Sethu Vijayakum...

claim paper

Read More »

« Prev « First page 3 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers