Sciweavers

42 search results - page 9 / 9
» Incremental learning of gestures by imitation in a humanoid ...
Sort
View
PAMI
2007
186views more  PAMI 2007»
13 years 4 months ago
Value-Directed Human Behavior Analysis from Video Using Partially Observable Markov Decision Processes
—This paper presents a method for learning decision theoretic models of human behaviors from video data. Our system learns relationships between the movements of a person, the co...
Jesse Hoey, James J. Little
NN
2010
Springer
125views Neural Networks» more  NN 2010»
13 years 3 months ago
Parameter-exploring policy gradients
We present a model-free reinforcement learning method for partially observable Markov decision problems. Our method estimates a likelihood gradient by sampling directly in paramet...
Frank Sehnke, Christian Osendorfer, Thomas Rü...