Sciweavers

81 search results - page 10 / 17
» The Optimal Reward Baseline for Gradient-Based Reinforcement...
Sort
View
75
Voted
BMCV
2000
Springer
15 years 1 months ago
Unsupervised Learning of Biologically Plausible Object Recognition Strategies
Recent psychological and neurological evidence suggests that biological object recognition is a process of matching sensed images to stored iconic memories. This paper presents a p...
Bruce A. Draper, Kyungim Baek
84
Voted
ATAL
2008
Springer
14 years 11 months ago
Transfer of task representation in reinforcement learning using policy-based proto-value functions
Reinforcement Learning research is traditionally devoted to solve single-task problems. Therefore, anytime a new task is faced, learning must be restarted from scratch. Recently, ...
Eliseo Ferrante, Alessandro Lazaric, Marcello Rest...
IJCNN
2006
IEEE
15 years 3 months ago
Learning a Rendezvous Task with Dynamic Joint Action Perception
Abstract— Groups of reinforcement learning agents interacting in a common environment often fail to learn optimal behaviors. Poor performance is particularly common in environmen...
Nancy Fulda, Dan Ventura
ICRA
2008
IEEE
173views Robotics» more  ICRA 2008»
15 years 4 months ago
Bayesian reinforcement learning in continuous POMDPs with application to robot navigation
— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...
Stéphane Ross, Brahim Chaib-draa, Joelle Pi...
INLG
2010
Springer
14 years 7 months ago
Hierarchical Reinforcement Learning for Adaptive Text Generation
We present a novel approach to natural language generation (NLG) that applies hierarchical reinforcement learning to text generation in the wayfinding domain. Our approach aims to...
Nina Dethlefs, Heriberto Cuayáhuitl