Search Sciweavers | Sciweavers

238 search results - page 5 / 48

» Value-Function Approximations for Partially Observable Marko...

Voted

AIPS
2008

111views Artificial Intelligence» more AIPS 2008»

Multiagent Planning Under Uncertainty with Stochastic Communication Delays

15 years 1 months ago

Download www.aaai.org

We consider the problem of cooperative multiagent planning under uncertainty, formalized as a decentralized partially observable Markov decision process (Dec-POMDP). Unfortunately...

Matthijs T. J. Spaan, Frans A. Oliehoek, Nikos A. ...

claim paper

Read More »

click to vote

AAAI
2008

144views Intelligent Agents» more AAAI 2008»

A Variance Analysis for POMDP Policy Evaluation

15 years 1 months ago

Download www.cs.mcgill.ca

Partially Observable Markov Decision Processes have been studied widely as a model for decision making under uncertainty, and a number of methods have been developed to find the s...

Mahdi Milani Fard, Joelle Pineau, Peng Sun

claim paper

Read More »

click to vote

NIPS
2001

144views Information Technology» more NIPS 2001»

Variance Reduction Techniques for Gradient Estimates in Reinforcement Learning

15 years 29 days ago

Download jmlr.csail.mit.edu

Policy gradient methods for reinforcement learning avoid some of the undesirable properties of the value function approaches, such as policy degradation (Baxter and Bartlett, 2001...

Evan Greensmith, Peter L. Bartlett, Jonathan Baxte...

claim paper

Read More »

100

click to vote

ARTMED
2000

105views more ARTMED 2000»

Planning treatment of ischemic heart disease with partially observable Markov decision processes

14 years 11 months ago

Download groups.csail.mit.edu

Diagnosis of a disease and its treatment are not separate, one-shot activities. Instead, they are very often dependent and interleaved over time. This is mostly due to uncertainty...

Milos Hauskrecht, Hamish S. F. Fraser

claim paper

Read More »

click to vote

ECML
2005
Springer

143views Machine Learning» more ECML 2005»

Active Learning in Partially Observable Markov Decision Processes

15 years 5 months ago

Download www.cs.mcgill.ca

This paper examines the problem of ﬁnding an optimal policy for a Partially Observable Markov Decision Process (POMDP) when the model is not known or is only poorly speciﬁed. W...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

« Prev « First page 5 / 48 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers