Sciweavers

575 search results - page 33 / 115
» Reinforcement Learning State Estimator
Sort
View
IJRR
2011
159views more  IJRR 2011»
14 years 4 months ago
Learning visual representations for perception-action systems
We discuss vision as a sensory modality for systems that effect actions in response to perceptions. While the internal representations informed by vision may be arbitrarily compl...
Justus H. Piater, Sébastien Jodogne, Renaud...
NECO
2010
97views more  NECO 2010»
14 years 8 months ago
Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning
Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...
Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...
FLAIRS
2006
14 years 11 months ago
Using Active Relocation to Aid Reinforcement Learning
We propose a new framework for aiding a reinforcement learner by allowing it to relocate, or move, to a state it selects so as to decrease the number of steps it needs to take in ...
Lilyana Mihalkova, Raymond J. Mooney
ECAI
2006
Springer
15 years 1 months ago
Learning by Automatic Option Discovery from Conditionally Terminating Sequences
Abstract. This paper proposes a novel approach to discover options in the form of conditionally terminating sequences, and shows how they can be integrated into reinforcement learn...
Sertan Girgin, Faruk Polat, Reda Alhajj
ICML
2005
IEEE
15 years 10 months ago
Reinforcement learning with Gaussian processes
Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...
Yaakov Engel, Shie Mannor, Ron Meir