Search Sciweavers | Sciweavers

575 search results - page 19 / 115

» Reinforcement Learning State Estimator

click to vote

ICRA
1995
IEEE

123views Robotics» more ICRA 1995»

Vision-Based Reinforcement Learning for Purposive Behavior Acquisition

15 years 1 months ago

Download www.er.ams.eng.osaka-u.ac.jp

This paper presents a method of vision-based reinforcement learning by which a robot learns to shoot a ball into a goal, and discusses several issues in applying the reinforcement...

Minoru Asada, Shoichi Noda, Sukoya Tawaratsumida, ...

claim paper

Read More »

click to vote

ICML
2002
IEEE

155views Machine Learning» more ICML 2002»

Discovering Hierarchy in Reinforcement Learning with HEXQ

15 years 10 months ago

Download www.cs.berkeley.edu

An open problem in reinforcement learning is discovering hierarchical structure. HEXQ, an algorithm which automatically attempts to decompose and solve a model-free factored MDP h...

Bernhard Hengst

claim paper

Read More »

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

14 years 11 months ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

click to vote

ATAL
2004
Springer

101views Intelligent Agents» more ATAL 2004»

From Global Selective Perception to Local Selective Perception

15 years 3 months ago

Download www.damas.ift.ulaval.ca

This paper presents a reinforcement learning algorithm used to allocate tasks to agents in an uncertain real-time environment. In such environment, tasks have to be analyzed and a...

Sébastien Paquet, Nicolas Bernier, Brahim C...

claim paper

Read More »

click to vote

ECML
2003
Springer

149views Machine Learning» more ECML 2003»

Could Active Perception Aid Navigation of Partially Observable Grid Worlds?

15 years 3 months ago

Download homepages.inf.ed.ac.uk

Due to the unavoidable fact that a robot’s sensors will be limited in some manner, it is entirely possible that it can ﬁnd itself unable to distinguish between diﬀering state...

Paul A. Crook, Gillian Hayes

claim paper

Read More »

« Prev « First page 19 / 115 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers