Search Sciweavers | Sciweavers

226 search results - page 11 / 46

» Linear Bayesian Reinforcement Learning

115

click to vote

JMLR
2010

125views more JMLR 2010»

Variational methods for Reinforcement Learning

14 years 9 months ago

Download jmlr.csail.mit.edu

We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...

Thomas Furmston, David Barber

claim paper

Read More »

114

click to vote

GECCO
2009
Springer

135views Optimization» more GECCO 2009»

Neuroevolutionary reinforcement learning for generalized helicopter control

15 years 9 months ago

Download www.science.uva.nl

Helicopter hovering is an important challenge problem in the ﬁeld of reinforcement learning. This paper considers several neuroevolutionary approaches to discovering robust cont...

Rogier Koppejan, Shimon Whiteson

claim paper

Read More »

145

click to vote

NN
2010
Springer

187views Neural Networks» more NN 2010»

Efficient exploration through active learning for value function approximation in reinforcement learning

14 years 9 months ago

Download sugiyama-www.cs.titech.ac.jp

Appropriately designing sampling policies is highly important for obtaining better control policies in reinforcement learning. In this paper, we first show that the least-squares ...

Takayuki Akiyama, Hirotaka Hachiya, Masashi Sugiya...

claim paper

Read More »

135

click to vote

ICML
2010
IEEE

188views Machine Learning» more ICML 2010»

Constructing States for Reinforcement Learning

15 years 26 days ago

Download www.icml2010.org

POMDPs are the models of choice for reinforcement learning (RL) tasks where the environment cannot be observed directly. In many applications we need to learn the POMDP structure ...

M. M. Hassan Mahmud

claim paper

Read More »

155

click to vote

AAAI
2011

202views Intelligent Agents» more AAAI 2011»

Value Function Approximation in Reinforcement Learning Using the Fourier Basis

14 years 2 months ago

Download people.csail.mit.edu

We describe the Fourier Basis, a linear value function approximation scheme based on the Fourier Series. We empirically evaluate its properties, and demonstrate that it performs w...

George Konidaris, Sarah Osentoski, Philip Thomas

claim paper

Read More »

« Prev « First page 11 / 46 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers