Search Sciweavers | Sciweavers

7 search results - page 1 / 2

» A Variance Analysis for POMDP Policy Evaluation

click to vote

AAAI
2008

144views Intelligent Agents» more AAAI 2008»

A Variance Analysis for POMDP Policy Evaluation

13 years 7 months ago

Download www.cs.mcgill.ca

Partially Observable Markov Decision Processes have been studied widely as a model for decision making under uncertainty, and a number of methods have been developed to find the s...

Mahdi Milani Fard, Joelle Pineau, Peng Sun

claim paper

Read More »

click to vote

NIPS
2008

116views Information Technology» more NIPS 2008»

Particle Filter-based Policy Gradient in POMDPs

13 years 6 months ago

Download eprints.pascal-network.org

Our setting is a Partially Observable Markov Decision Process with continuous state, observation and action spaces. Decisions are based on a Particle Filter for estimating the bel...

Pierre-Arnaud Coquelin, Romain Deguest, Rém...

claim paper

Read More »

click to vote

ATAL
2009
Springer

198views Intelligent Agents» more ATAL 2009»

SarsaLandmark: an algorithm for learning in POMDPs with landmarks

13 years 11 months ago

Download www.aamas-conference.org

Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...

Michael R. James, Satinder P. Singh

claim paper

Read More »

click to vote

NIPS
2008

110views Information Technology» more NIPS 2008»

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

13 years 6 months ago

Download groups.csail.mit.edu

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...

John W. Roberts, Russ Tedrake

claim paper

Read More »

click to vote

PROMAS
2004
Springer

189views Intelligent Agents» more PROMAS 2004»

Coordinating Teams in Uncertain Environments: A Hybrid BDI-POMDP Approach

13 years 10 months ago

Download teamcore.usc.edu

Distributed partially observable Markov decision problems (POMDPs) have emerged as a popular decision-theoretic approach for planning for multiagent teams, where it is imperative f...

Ranjit Nair, Milind Tambe

claim paper

Read More »

« Prev « First page 1 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers