Search Sciweavers | Sciweavers

168 search results - page 2 / 34

» Reinforcement Learning Algorithm for Partially Observable Ma...

click to vote

ATAL
2009
Springer

198views Intelligent Agents» more ATAL 2009»

SarsaLandmark: an algorithm for learning in POMDPs with landmarks

13 years 11 months ago

Download www.aamas-conference.org

Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...

Michael R. James, Satinder P. Singh

claim paper

Read More »

click to vote

PAMI
2007

186views more PAMI 2007»

Value-Directed Human Behavior Analysis from Video Using Partially Observable Markov Decision Processes

13 years 4 months ago

Download people.ee.duke.edu

—This paper presents a method for learning decision theoretic models of human behaviors from video data. Our system learns relationships between the movements of a person, the co...

Jesse Hoey, James J. Little

claim paper

Read More »

click to vote

ICML
2008
IEEE

135views Machine Learning» more ICML 2008»

Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs

14 years 5 months ago

Download mapleleaf.csail.mit.edu

Partially Observable Markov Decision Processes (POMDPs) have succeeded in planning domains that require balancing actions that increase an agent's knowledge and actions that ...

Finale Doshi, Joelle Pineau, Nicholas Roy

claim paper

Read More »

click to vote

ICANN
2001
Springer

123views Neural Networks» more ICANN 2001»

Market-Based Reinforcement Learning in Partially Observable Worlds

13 years 9 months ago

Download www.hutter1.net

Unlike traditional reinforcement learning (RL), market-based RL is in principle applicable to worlds described by partially observable Markov Decision Processes (POMDPs), where an ...

Ivo Kwee, Marcus Hutter, Jürgen Schmidhuber

claim paper

Read More »

click to vote

IROS
2009
IEEE

206views Robotics» more IROS 2009»

Bayesian reinforcement learning in continuous POMDPs with gaussian processes

13 years 11 months ago

Download www.cs.cmu.edu

— Partially Observable Markov Decision Processes (POMDPs) provide a rich mathematical model to handle realworld sequential decision processes but require a known model to be solv...

Patrick Dallaire, Camille Besse, Stéphane R...

claim paper

Read More »

« Prev « First page 2 / 34 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers