Search Sciweavers | Sciweavers

7 search results - page 1 / 2

» A Variance Analysis for POMDP Policy Evaluation

click to vote

AAAI
2008

144views Intelligent Agents» more AAAI 2008»

A Variance Analysis for POMDP Policy Evaluation

13 years 11 months ago

Download www.cs.mcgill.ca

Partially Observable Markov Decision Processes have been studied widely as a model for decision making under uncertainty, and a number of methods have been developed to find the s...

Mahdi Milani Fard, Joelle Pineau, Peng Sun

claim paper

Read More »

click to vote

NIPS
2008

116views Information Technology» more NIPS 2008»

Particle Filter-based Policy Gradient in POMDPs

13 years 10 months ago

Download eprints.pascal-network.org

Our setting is a Partially Observable Markov Decision Process with continuous state, observation and action spaces. Decisions are based on a Particle Filter for estimating the bel...

Pierre-Arnaud Coquelin, Romain Deguest, Rém...

claim paper

Read More »

click to vote

ATAL
2009
Springer

198views Intelligent Agents» more ATAL 2009»

SarsaLandmark: an algorithm for learning in POMDPs with landmarks

14 years 4 months ago

Download www.aamas-conference.org

Reinforcement learning algorithms that use eligibility traces, such as Sarsa(λ), have been empirically shown to be effective in learning good estimated-state-based policies in pa...

Michael R. James, Satinder P. Singh

claim paper

Read More »

click to vote

NIPS
2008

110views Information Technology» more NIPS 2008»

Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms

13 years 10 months ago

Download groups.csail.mit.edu

Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...

John W. Roberts, Russ Tedrake

claim paper

Read More »

click to vote

PROMAS
2004
Springer

189views Intelligent Agents» more PROMAS 2004»

Coordinating Teams in Uncertain Environments: A Hybrid BDI-POMDP Approach

14 years 2 months ago

Download teamcore.usc.edu

Distributed partially observable Markov decision problems (POMDPs) have emerged as a popular decision-theoretic approach for planning for multiagent teams, where it is imperative f...

Ranjit Nair, Milind Tambe

claim paper

Read More »

« Prev « First page 1 / 2 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers