Search Sciweavers | Sciweavers

21 search results - page 1 / 5

» Bayesian reinforcement learning in continuous POMDPs with ga...

224

click to vote

IROS
2009
IEEE

206views Robotics» more IROS 2009»

Bayesian reinforcement learning in continuous POMDPs with gaussian processes

16 years 2 months ago

Download www.cs.cmu.edu

— Partially Observable Markov Decision Processes (POMDPs) provide a rich mathematical model to handle realworld sequential decision processes but require a known model to be solv...

Patrick Dallaire, Camille Besse, Stéphane R...

claim paper

Read More »

224

click to vote

ICRA
2008
IEEE

173views Robotics» more ICRA 2008»

Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

16 years 2 months ago

Download www.cs.cmu.edu

— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

244

click to vote

PKDD
2010
Springer

179views Data Mining» more PKDD 2010»

Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration

15 years 5 months ago

Download www.cs.utexas.edu

Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...

Tobias Jung, Peter Stone

claim paper

Read More »

280

click to vote

NIPS
2007

207views Information Technology» more NIPS 2007»

Bayes-Adaptive POMDPs

15 years 9 months ago

Download books.nips.cc

Bayesian Reinforcement Learning has generated substantial interest recently, as it provides an elegant solution to the exploration-exploitation trade-off in reinforcement learning...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

205

click to vote

ICML
2005
IEEE

100views Machine Learning» more ICML 2005»

Reinforcement learning with Gaussian processes

16 years 8 months ago

Download www.machinelearning.org

Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

« Prev « First page 1 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers