Search Sciweavers | Sciweavers

121 search results - page 17 / 25

» Learning Decision Theoretic Utilities through Reinforcement ...

click to vote

ICRA
2010
IEEE

128views Robotics» more ICRA 2010»

A game-theoretic procedure for learning hierarchically structured strategies

14 years 10 months ago

Download homepages.inf.ed.ac.uk

— This paper addresses the problem of acquiring a hierarchically structured robotic skill in a nonstationary environment. This is achieved through a combination of learning primi...

Benjamin Rosman, Subramanian Ramamoorthy

claim paper

Read More »

click to vote

CORR
2010
Springer

152views Education» more CORR 2010»

Neuroevolutionary optimization

14 years 11 months ago

Download jmlr.csail.mit.edu

Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...

Eva Volná

claim paper

Read More »

117

click to vote

IAT
2010
IEEE

167views Intelligent Agents» more IAT 2010»

Selecting Operator Queries Using Expected Myopic Gain

14 years 9 months ago

Download www.eecs.umich.edu

When its human operator cannot continuously supervise (much less teleoperate) an agent, the agent should be able to recognize its limitations and ask for help when it risks making...

Robert Cohn, Michael Maxim, Edmund H. Durfee, Sati...

claim paper

Read More »

click to vote

CVPR
2004
IEEE

182views Computer Vision» more CVPR 2004»

Value Directed Learning of Gestures and Facial Displays

16 years 1 months ago

Download people.cs.ubc.ca

This paper presents a method for learning decision theoretic models of facial expressions and gestures from video data. We consider that the meaning of a facial display or gesture...

Jesse Hoey, James J. Little

claim paper

Read More »

click to vote

ICANN
2007
Springer

95views Neural Networks» more ICANN 2007»

Solving Deep Memory POMDPs with Recurrent Policy Gradients

15 years 5 months ago

Download www.idsia.ch

Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...

Daan Wierstra, Alexander Förster, Jan Peters,...

claim paper

Read More »

« Prev « First page 17 / 25 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers