Search Sciweavers | Sciweavers

40 search results - page 4 / 8

» Learning Partially Observable Action Schemas

click to vote

ECML
2007
Springer

108views Machine Learning» more ECML 2007»

Safe Q-Learning on Complete History Spaces

13 years 12 months ago

Download www.ni.uos.de

In this article, we present an idea for solving deterministic partially observable markov decision processes (POMDPs) based on a history space containing sequences of past observat...

Stephan Timmer, Martin Riedmiller

claim paper

Read More »

click to vote

ICML
2004
IEEE

120views Machine Learning» more ICML 2004»

Utile distinction hidden Markov models

14 years 6 months ago

Download www.idsia.ch

This paper addresses the problem of constructing good action selection policies for agents acting in partially observable environments, a class of problems generally known as Part...

Daan Wierstra, Marco Wiering

claim paper

Read More »

click to vote

ICRA
2008
IEEE

173views Robotics» more ICRA 2008»

Bayesian reinforcement learning in continuous POMDPs with application to robot navigation

14 years 6 days ago

Download www.cs.cmu.edu

— We consider the problem of optimal control in continuous and partially observable environments when the parameters of the model are not known exactly. Partially Observable Mark...

Stéphane Ross, Brahim Chaib-draa, Joelle Pi...

claim paper

Read More »

click to vote

ECCV
2004
Springer

361views Computer Vision» more ECCV 2004»

Decision Theoretic Modeling of Human Facial Displays

14 years 7 months ago

Download people.cs.ubc.ca

We present a vision based, adaptive, decision theoretic model of human facial displays in interactions. The model is a partially observable Markov decision process, or POMDP. A POM...

Jesse Hoey, James J. Little

claim paper

Read More »

click to vote

CORR
2011
Springer

161views Education» more CORR 2011»

Doubly Robust Policy Evaluation and Learning

12 years 9 months ago

Download www.icml-2011.org

We study decision making in environments where the reward is only partially observed, but can be modeled as a function of an action and an observed context. This setting, known as...

Miroslav Dudík, John Langford, Lihong Li

claim paper

Read More »

« Prev « First page 4 / 8 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers