Sciweavers

4446 search results - page 34 / 890
» Learning Observer Agents
Sort
View
IJCAI
2007
14 years 11 months ago
Inferring Complex Agent Motions from Partial Trajectory Observations
Finnegan Southey, Wesley Loh, Dana F. Wilkinson
ATAL
2009
Springer
15 years 4 months ago
SarsaLandmark: an algorithm for learning in POMDPs with landmarks
Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...
Michael R. James, Satinder P. Singh
ECCV
2004
Springer
15 years 11 months ago
Decision Theoretic Modeling of Human Facial Displays
We present a vision based, adaptive, decision theoretic model of human facial displays in interactions. The model is a partially observable Markov decision process, or POMDP. A POM...
Jesse Hoey, James J. Little