Search Sciweavers | Sciweavers

21 search results - page 3 / 5

» Variance Reduction Techniques for Gradient Estimates in Rein...

click to vote

ECML
2004
Springer

77views Machine Learning» more ECML 2004»

Filtered Reinforcement Learning

13 years 11 months ago

Download eprints.pascal-network.org

Reinforcement learning (RL) algorithms attempt to assign the credit for rewards to the actions that contributed to the reward. Thus far, credit assignment has been done in one of t...

Douglas Aberdeen

claim paper

Read More »

click to vote

ICML
2003
IEEE

132views Machine Learning» more ICML 2003»

Low Bias Bagged Support Vector Machines

14 years 6 months ago

Download web.engr.oregonstate.edu

Theoretical and experimental analyses of bagging indicate that it is primarily a variance reduction technique. This suggests that bagging should be applied to learning algorithms ...

Giorgio Valentini, Thomas G. Dietterich

claim paper

Read More »

click to vote

CIS
2005
Springer

129views Applied Computing» more CIS 2005»

An RLS-Based Natural Actor-Critic Algorithm for Locomotion of a Two-Linked Robot Arm

13 years 11 months ago

Download www-clmc.usc.edu

Recently, actor-critic methods have drawn much interests in the area of reinforcement learning, and several algorithms have been studied along the line of the actor-critic strategy...

Jooyoung Park, Jongho Kim, Daesung Kang

claim paper

Read More »

click to vote

ACL
2009

123views Computational Linguistics» more ACL 2009»

Reinforcement Learning for Mapping Instructions to Actions

13 years 3 months ago

Download www.aclweb.org

In this paper, we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function tha...

S. R. K. Branavan, Harr Chen, Luke S. Zettlemoyer,...

claim paper

Read More »

click to vote

ICML
2002
IEEE

127views Machine Learning» more ICML 2002»

Action Refinement in Reinforcement Learning by Probability Smoothing

14 years 6 months ago

Download www.cs.berkeley.edu

In many reinforcement learning applications, the set of possible actions can be partitioned by the programmer into subsets of similar actions. This paper presents a technique for ...

Carles Sierra, Dídac Busquets, Ramon L&oacu...

claim paper

Read More »

« Prev « First page 3 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers