Search Sciweavers | Sciweavers

575 search results - page 67 / 115

» Reinforcement Learning State Estimator

124

click to vote

AAAI
2008

207views Intelligent Agents» more AAAI 2008»

Adaptive Importance Sampling with Automatic Model Selection in Value Function Approximation

15 years 6 months ago

Download sugiyama-www.cs.titech.ac.jp

Off-policy reinforcement learning is aimed at efficiently reusing data samples gathered in the past, which is an essential problem for physically grounded AI as experiments are us...

Hirotaka Hachiya, Takayuki Akiyama, Masashi Sugiya...

claim paper

Read More »

142

click to vote

IBERAMIA
2010
Springer

245views Artificial Intelligence» more IBERAMIA 2010»

Dynamic Reward Shaping: Training a Robot by Voice

15 years 2 months ago

Download ccc.inaoep.mx

Reinforcement Learning is commonly used for learning tasks in robotics, however, traditional algorithms can take very long training times. Reward shaping has been recently used to ...

Ana C. Tenorio-Gonzalez, Eduardo F. Morales, Luis ...

claim paper

Read More »

153

click to vote

TIP
2008

86views more TIP 2008»

Learning the Dynamics and Time-Recursive Boundary Detection of Deformable Objects

15 years 4 months ago

Download research.sabanciuniv.edu

We propose a principled framework for recursively segmenting deformable objects across a sequence of frames. We demonstrate the usefulness of this method on left ventricular segmen...

Walter Sun, Müjdat Çetin, Raymond C. C...

claim paper

Read More »

160

click to vote

Publication

222views

Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration

16 years 1 months ago

Download arxiv.org

Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

156

click to vote

ICML
1996
IEEE

162views Machine Learning» more ICML 1996»

Learning Evaluation Functions for Large Acyclic Domains

16 years 5 months ago

Download www.ri.cmu.edu

Some of the most successful recent applications of reinforcement learning have used neural networks and the TD algorithm to learn evaluation functions. In this paper, we examine t...

Justin A. Boyan, Andrew W. Moore

claim paper

Read More »

« Prev « First page 67 / 115 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers