Sciweavers

575 search results - page 67 / 115
» Reinforcement Learning State Estimator
Sort
View
AAAI
2008
15 years 9 days ago
Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation
Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...
Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...
IBERAMIA
2010
Springer
14 years 8 months ago
Dynamic Reward Shaping: Training a Robot by Voice
Reinforcement Learning is commonly used for learning tasks in robotics, however, traditional algorithms can take very long training times. Reward shaping has been recently used to ...
Ana C. Tenorio-Gonzalez, Eduardo F. Morales, Luis ...
TIP
2008
86views more  TIP 2008»
14 years 10 months ago
Learning the Dynamics and Time-Recursive Boundary Detection of Deformable Objects
We propose a principled framework for recursively segmenting deformable objects across a sequence of frames. We demonstrate the usefulness of this method on left ventricular segmen...
Walter Sun, Müjdat Çetin, Raymond C. C...

Publication
222views
15 years 7 months ago
Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration
Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...
Christos Dimitrakakis, Michail G. Lagoudakis
ICML
1996
IEEE
15 years 10 months ago
Learning Evaluation Functions for Large Acyclic Domains
Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...
Justin A. Boyan, Andrew W. Moore