Sciweavers

176 search results - page 3 / 36
» Optimal Sample Selection for Batch-mode Reinforcement Learni...
Sort
View
ICRA
2008
IEEE
173views Robotics» more  ICRA 2008»
13 years 11 months ago
Bayesian reinforcement learning in continuous POMDPs with application to robot navigation
— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...
Stéphane Ross, Brahim Chaib-draa, Joelle Pi...
ICMLA
2009
13 years 3 months ago
The Neuro Slot Car Racer: Reinforcement Learning in a Real World Setting
This paper describes a novel real-world reinforcement learning application: The Neuro Slot Car Racer. In addition to presenting the system and first results based on Neural Fitted...
Tim C. Kietzmann, Martin Riedmiller
ATAL
2005
Springer
13 years 11 months ago
Improving reinforcement learning function approximators via neuroevolution
Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...
Shimon Whiteson
PKDD
2009
Springer
152views Data Mining» more  PKDD 2009»
13 years 12 months ago
Feature Selection for Value Function Approximation Using Bayesian Model Selection
Abstract. Feature selection in reinforcement learning (RL), i.e. choosing basis functions such that useful approximations of the unkown value function can be obtained, is one of th...
Tobias Jung, Peter Stone
IWANN
1999
Springer
13 years 9 months ago
Using Temporal Neighborhoods to Adapt Function Approximators in Reinforcement Learning
To avoid the curse of dimensionality, function approximators are used in reinforcement learning to learn value functions for individual states. In order to make better use of comp...
R. Matthew Kretchmar, Charles W. Anderson