Search Sciweavers | Sciweavers

575 search results - page 33 / 115

» Reinforcement Learning State Estimator

143

click to vote

IJRR
2011

159views more IJRR 2011»

Learning visual representations for perception-action systems

14 years 11 months ago

Download robot-learning.de

We discuss vision as a sensory modality for systems that eﬀect actions in response to perceptions. While the internal representations informed by vision may be arbitrarily compl...

Justus H. Piater, Sébastien Jodogne, Renaud...

claim paper

Read More »

140

click to vote

NECO
2010

97views more NECO 2010»

Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning

15 years 2 months ago

Download www.kyb.tuebingen.mpg.de

Most conventional Policy Gradient Reinforcement Learning (PGRL) algorithms neglect (or do not explicitly make use of) a term in the average reward gradient with respect to the pol...

Tetsuro Morimura, Eiji Uchibe, Junichiro Yoshimoto...

claim paper

Read More »

120

click to vote

FLAIRS
2006

103views Artificial Intelligence» more FLAIRS 2006»

Using Active Relocation to Aid Reinforcement Learning

15 years 5 months ago

Download www.cs.utexas.edu

We propose a new framework for aiding a reinforcement learner by allowing it to relocate, or move, to a state it selects so as to decrease the number of steps it needs to take in ...

Lilyana Mihalkova, Raymond J. Mooney

claim paper

Read More »

126

click to vote

ECAI
2006
Springer

89views Artificial Intelligence» more ECAI 2006»

Learning by Automatic Option Discovery from Conditionally Terminating Sequences

15 years 8 months ago

Download www.ceng.metu.edu.tr

Abstract. This paper proposes a novel approach to discover options in the form of conditionally terminating sequences, and shows how they can be integrated into reinforcement learn...

Sertan Girgin, Faruk Polat, Reda Alhajj

claim paper

Read More »

143

click to vote

ICML
2005
IEEE

100views Machine Learning» more ICML 2005»

Reinforcement learning with Gaussian processes

16 years 5 months ago

Download www.machinelearning.org

Gaussian Process Temporal Difference (GPTD) learning offers a Bayesian solution to the policy evaluation problem of reinforcement learning. In this paper we extend the GPTD framew...

Yaakov Engel, Shie Mannor, Ron Meir

claim paper

Read More »

« Prev « First page 33 / 115 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers