Search Sciweavers | Sciweavers

226 search results - page 13 / 46

» Linear Bayesian Reinforcement Learning

166

click to vote

PKDD
2010
Springer

179views Data Mining» more PKDD 2010»

Gaussian Processes for Sample Efficient Reinforcement Learning with RMAX-Like Exploration

15 years 22 days ago

Download www.cs.utexas.edu

Abstract. We present an implementation of model-based online reinforcement learning (RL) for continuous domains with deterministic transitions that is specifically designed to achi...

Tobias Jung, Peter Stone

claim paper

Read More »

138

click to vote

NIPS
2008

162views Information Technology» more NIPS 2008»

Nonparametric Bayesian Learning of Switching Linear Dynamical Systems

15 years 4 months ago

Download www.cs.berkeley.edu

Many nonlinear dynamical phenomena can be effectively modeled by a system that switches among a set of conditionally linear dynamical modes. We consider two such models: the switc...

Emily B. Fox, Erik B. Sudderth, Michael I. Jordan,...

claim paper

Read More »

133

click to vote

ICML
2004
IEEE

214views Machine Learning» more ICML 2004»

Apprenticeship learning via inverse reinforcement learning

16 years 3 months ago

Download ai.stanford.edu

We consider learning in a Markov decision process where we are not explicitly given a reward function, but where instead we can observe an expert demonstrating the task that we wa...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

139

click to vote

ICML
2005
IEEE

127views Machine Learning» more ICML 2005»

Exploration and apprenticeship learning in reinforcement learning

16 years 3 months ago

Download ai.stanford.edu

We consider reinforcement learning in systems with unknown dynamics. Algorithms such as E3 (Kearns and Singh, 2002) learn near-optimal policies by using "exploration policies...

Pieter Abbeel, Andrew Y. Ng

claim paper

Read More »

117

click to vote

ICRA
2009
IEEE

111views Robotics» more ICRA 2009»

Model-based and model-free reinforcement learning for visual servoing

15 years 9 months ago

Download webdocs.cs.ualberta.ca

— To address the difﬁculty of designing a controller for complex visual-servoing tasks, two learning-based uncalibrated approaches are introduced. The ﬁrst method starts by b...

Amir Massoud Farahmand, Azad Shademan, Martin J&au...

claim paper

Read More »

« Prev « First page 13 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers