Search Sciweavers | Sciweavers

87 search results - page 1 / 18

» A policy iteration algorithm for Markov decision processes s...

click to vote

DAGSTUHL
2007

74views Software Engineering» more DAGSTUHL 2007»

A policy iteration algorithm for Markov decision processes skip-free in one direction

13 years 6 months ago

Download drops.dagstuhl.de

Joke Lambert, Benny Van Houdt, Chris Blondia

claim paper

Read More »

click to vote

AAAI
2006

123views Intelligent Agents» more AAAI 2006»

An Iterative Algorithm for Solving Constrained Decentralized Markov Decision Processes

13 years 6 months ago

Download www.aaai.org

Despite the significant progress to extend Markov Decision Processes (MDP) to cooperative multi-agent systems, developing approaches that can deal with realistic problems remains ...

Aurélie Beynier, Abdel-Illah Mouaddib

claim paper

Read More »

click to vote

ICML
2006
IEEE

143views Machine Learning» more ICML 2006»

Fast direct policy evaluation using multiscale analysis of Markov diffusion processes

14 years 5 months ago

Download www.cs.umass.edu

Policy evaluation is a critical step in the approximate solution of large Markov decision processes (MDPs), typically requiring O(|S|3 ) to directly solve the Bellman system of |S...

Mauro Maggioni, Sridhar Mahadevan

claim paper

Read More »

click to vote

ICML
2001
IEEE

145views Machine Learning» more ICML 2001»

Symmetry in Markov Decision Processes and its Implications for Single Agent and Multiagent Learning

14 years 5 months ago

Download www-2.cs.cmu.edu

This paper examines the notion of symmetry in Markov decision processes (MDPs). We define symmetry for an MDP and show how it can be exploited for more effective learning in singl...

Martin Zinkevich, Tucker R. Balch

claim paper

Read More »

click to vote

NIPS
2004

125views Information Technology» more NIPS 2004»

VDCBPI: an Approximate Scalable Algorithm for Large POMDPs

13 years 6 months ago

Download books.nips.cc

Existing algorithms for discrete partially observable Markov decision processes can at best solve problems of a few thousand states due to two important sources of intractability:...

Pascal Poupart, Craig Boutilier

claim paper

Read More »

« Prev « First page 1 / 18 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers