Reinforcement Learning for Mapping Instructions to Actions

13 years 2 months ago

Download www.aclweb.org

In this paper, we present a reinforcement learning approach for mapping natural language instructions to sequences of executable actions. We assume access to a reward function that defines the quality of the executed actions. During training, the learner repeatedly constructs action sequences for a set of documents, executes those actions, and observes the resulting reward. We use a policy gradient algorithm to estimate the parameters of a log-linear model for action selection. We apply our method to interpret instructions in two domains -- Windows troubleshooting guides and game tutorials. Our results demonstrate that this method can rival supervised learning techniques while requiring few or no annotated training examples.1

S. R. K. Branavan, Harr Chen, Luke S. Zettlemoyer,

Real-time Traffic

ACL 2009 | Computational Linguistics | Executable Actions | Natural Language Instructions | Reward Function |

claim paper

» Reinforcement Learning in MirrorBot

» Interactive learning of mappings from visual percepts to actions

» Reinforcement Learning An Introduction

» Reading between the Lines Learning to Map HighLevel Instructions to Commands

» TaskDriven Discretization of the Joint Space of Visual Percepts and Continuous Actions

» Transfer via intertask mappings in policy search reinforcement learning

» Batch Reinforcement Learning with State Importance

» Skill Acquisition Via Transfer Learning and Advice Taking

Post Info
More Details (n/a)

Added	16 Feb 2011
Updated	16 Feb 2011
Type	Journal
Year	2009
Where	ACL
Authors	S. R. K. Branavan, Harr Chen, Luke S. Zettlemoyer, Regina Barzilay

Comments (0)

Sciweavers

Reinforcement Learning for Mapping Instructions to Actions

ACL 2009 | Computational Linguistics | Executable Actions | Natural Language Instructions | Reward Function |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers