Search Sciweavers | Sciweavers

166 search results - page 6 / 34

» Online model learning in adversarial Markov decision process...

109

click to vote

IROS
2009
IEEE

206views Robotics» more IROS 2009»

Bayesian reinforcement learning in continuous POMDPs with gaussian processes

15 years 6 months ago

Download www.cs.cmu.edu

— Partially Observable Markov Decision Processes (POMDPs) provide a rich mathematical model to handle realworld sequential decision processes but require a known model to be solv...

Patrick Dallaire, Camille Besse, Stéphane R...

claim paper

Read More »

108

click to vote

ICPR
2000
IEEE

100views computer vision» more ICPR 2000»

Realtime Online Adaptive Gesture Recognition

15 years 3 months ago

Download vismod.media.mit.edu

We introduce an online adaptive algorithm for learning gesture models. By learning gesture models in an online fashion, the gesture recognition process is made more robust, and th...

Andrew D. Wilson, Aaron F. Bobick

claim paper

Read More »

101

click to vote

ICMLC
2005
Springer

176views Machine Learning» more ICMLC 2005»

Adaptive Online Multi-stroke Sketch Recognition Based on Hidden Markov Model

15 years 4 months ago

Download cs.nju.edu.cn

This paper presents a novel approach for adaptive online multi-stroke sketch recognition based on Hidden Markov Model (HMM). The method views the drawing sketch as the result of a ...

Zhengxing Sun, Wei Jiang, Jianyong Sun

claim paper

Read More »

click to vote

ATAL
2009
Springer

146views Intelligent Agents» more ATAL 2009»

Online exploration in least-squares policy iteration

15 years 5 months ago

Download www.aamas-conference.org

One of the key problems in reinforcement learning is balancing exploration and exploitation. Another is learning and acting in large or even continuous Markov decision processes (...

Lihong Li, Michael L. Littman, Christopher R. Mans...

claim paper

Read More »

click to vote

NIPS
2000

127views Information Technology» more NIPS 2000»

Using Free Energies to Represent Q-values in a Multiagent Reinforcement Learning Task

15 years 19 days ago

Download members.chello.at

The problem of reinforcement learning in large factored Markov decision processes is explored. The Q-value of a state-action pair is approximated by the free energy of a product o...

Brian Sallans, Geoffrey E. Hinton

claim paper

Read More »

« Prev « First page 6 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers