Search Sciweavers | Sciweavers

56 search results - page 3 / 12

» Reinforcement Learning for Average Reward Zero-Sum Games

104

Voted

ACL
2009

123views Computational Linguistics» more ACL 2009»

Reinforcement Learning for Mapping Instructions to Actions

14 years 11 months ago

Download www.aclweb.org

In this paper, we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function tha...

S. R. K. Branavan, Harr Chen, Luke S. Zettlemoyer,...

claim paper

Read More »

108

click to vote

ICONIP
2007

147views Information Technology» more ICONIP 2007»

Finding Exploratory Rewards by Embodied Evolution and Constrained Reinforcement Learning in the Cyber Rodents

15 years 3 months ago

Download www.nc.irp.oist.jp

The aim of the Cyber Rodent project [1] is to elucidate the origin of our reward and aﬀective systems by building artiﬁcial agents that share the natural biological constraints...

Eiji Uchibe, Kenji Doya

claim paper

Read More »

150

Voted

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

14 years 8 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

click to vote

UAI
2001

129views Artificial Intelligence» more UAI 2001»

The Optimal Reward Baseline for Gradient-Based Reinforcement Learning

15 years 3 months ago

Download cs.anu.edu.au

There exist a number of reinforcement learning algorithms which learn by climbing the gradient of expected reward. Their long-run convergence has been proved, even in partially ob...

Lex Weaver, Nigel Tao

claim paper

Read More »

125

click to vote

JDCTA
2010

160views more JDCTA 2010»

Learning and Decision Making in Human During a Game of Matching Pennies

14 years 8 months ago

Download www.aicit.org

To gain insights into the neural basis of such adaptive decision-making processes, we investigated the nature of learning process in humans playing a competitive game with binary ...

Jianfeng Hu, Xiaofeng Li, Jinghai Yin

claim paper

Read More »

« Prev « First page 3 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers