Search Sciweavers | Sciweavers

238 search results - page 2 / 48

» Value-Function Approximations for Partially Observable Marko...

click to vote

ECML
2005
Springer

120views Machine Learning» more ECML 2005»

Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes

13 years 10 months ago

Download www.cs.mcgill.ca

Partially Observable Markov Decision Processes (POMDP) provide a standard framework for sequential decision making in stochastic environments. In this setting, an agent takes actio...

Masoumeh T. Izadi, Doina Precup

claim paper

Read More »

click to vote

CVIU
2010

163views more CVIU 2010»

Automated handwashing assistance for persons with dementia using video and a partially observable Markov decision process

13 years 4 months ago

Download www.cs.utoronto.ca

This paper presents a real-time vision-based system to assist a person with dementia wash their hands. The system uses only video inputs, and assistance is given as either verbal ...

Jesse Hoey, Pascal Poupart, Axel von Bertoldi, Tam...

claim paper

Read More »

click to vote

JMLR
2006

143views more JMLR 2006»

Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation

13 years 4 months ago

Download www.aaai.org

We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...

Rémi Munos

claim paper

Read More »

click to vote

AAAI
1996

197views Intelligent Agents» more AAAI 1996»

Computing Optimal Policies for Partially Observable Decision Processes Using Compact Representations

13 years 5 months ago

Download people.cs.ubc.ca

: Partially-observable Markov decision processes provide a very general model for decision-theoretic planning problems, allowing the trade-offs between various courses of actions t...

Craig Boutilier, David Poole

claim paper

Read More »

click to vote

AAAI
2006

157views Intelligent Agents» more AAAI 2006»

Compact, Convex Upper Bound Iteration for Approximate POMDP Planning

13 years 6 months ago

Download www.aaai.org

Partially observable Markov decision processes (POMDPs) are an intuitive and general way to model sequential decision making problems under uncertainty. Unfortunately, even approx...

Tao Wang, Pascal Poupart, Michael H. Bowling, Dale...

claim paper

Read More »

« Prev « First page 2 / 48 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers