Sciweavers

166 search results - page 6 / 34
» Online model learning in adversarial Markov decision process...
Sort
View
IROS
2009
IEEE
206views Robotics» more  IROS 2009»
15 years 4 months ago
Bayesian reinforcement learning in continuous POMDPs with gaussian processes
— Partially Observable Markov Decision Processes (POMDPs) provide a rich mathematical model to handle realworld sequential decision processes but require a known model to be solv...
Patrick Dallaire, Camille Besse, Stéphane R...
ICPR
2000
IEEE
15 years 2 months ago
Realtime Online Adaptive Gesture Recognition
We introduce an online adaptive algorithm for learning gesture models. By learning gesture models in an online fashion, the gesture recognition process is made more robust, and th...
Andrew D. Wilson, Aaron F. Bobick
ICMLC
2005
Springer
15 years 3 months ago
Adaptive Online Multi-stroke Sketch Recognition Based on Hidden Markov Model
This paper presents a novel approach for adaptive online multi-stroke sketch recognition based on Hidden Markov Model (HMM). The method views the drawing sketch as the result of a ...
Zhengxing Sun, Wei Jiang, Jianyong Sun
ATAL
2009
Springer
15 years 4 months ago
Online exploration in least-squares policy iteration
One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...
Lihong Li, Michael L. Littman, Christopher R. Mans...
NIPS
2000
14 years 11 months ago
Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task
The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...
Brian Sallans, Geoffrey E. Hinton