Sciweavers

121 search results - page 17 / 25
» Learning Decision Theoretic Utilities through Reinforcement ...
Sort
View
ICRA
2010
IEEE
128views Robotics» more  ICRA 2010»
14 years 8 months ago
A game-theoretic procedure for learning hierarchically structured strategies
— This paper addresses the problem of acquiring a hierarchically structured robotic skill in a nonstationary environment. This is achieved through a combination of learning primi...
Benjamin Rosman, Subramanian Ramamoorthy
CORR
2010
Springer
152views Education» more  CORR 2010»
14 years 9 months ago
Neuroevolutionary optimization
Temporal difference methods are theoretically grounded and empirically effective methods for addressing reinforcement learning problems. In most real-world reinforcement learning ...
Eva Volná
IAT
2010
IEEE
14 years 7 months ago
Selecting Operator Queries Using Expected Myopic Gain
When its human operator cannot continuously supervise (much less teleoperate) an agent, the agent should be able to recognize its limitations and ask for help when it risks making...
Robert Cohn, Michael Maxim, Edmund H. Durfee, Sati...
CVPR
2004
IEEE
15 years 11 months ago
Value Directed Learning of Gestures and Facial Displays
This paper presents a method for learning decision theoretic models of facial expressions and gestures from video data. We consider that the meaning of a facial display or gesture...
Jesse Hoey, James J. Little
ICANN
2007
Springer
15 years 3 months ago
Solving Deep Memory POMDPs with Recurrent Policy Gradients
Abstract. This paper presents Recurrent Policy Gradients, a modelfree reinforcement learning (RL) method creating limited-memory stochastic policies for partially observable Markov...
Daan Wierstra, Alexander Förster, Jan Peters,...