Search Sciweavers | Sciweavers

162 search results - page 2 / 33

» Topological Value Iteration Algorithm for Markov Decision Pr...

click to vote

ICML
2001
IEEE

145views Machine Learning» more ICML 2001»

Symmetry in Markov Decision Processes and its Implications for Single Agent and Multiagent Learning

14 years 5 months ago

Download www-2.cs.cmu.edu

This paper examines the notion of symmetry in Markov decision processes (MDPs). We define symmetry for an MDP and show how it can be exploited for more effective learning in singl...

Martin Zinkevich, Tucker R. Balch

claim paper

Read More »

click to vote

JMLR
2006

116views more JMLR 2006»

Point-Based Value Iteration for Continuous POMDPs

13 years 4 months ago

Download jmlr.csail.mit.edu

We propose a novel approach to optimize Partially Observable Markov Decisions Processes (POMDPs) defined on continuous spaces. To date, most algorithms for model-based POMDPs are ...

Josep M. Porta, Nikos A. Vlassis, Matthijs T. J. S...

claim paper

Read More »

click to vote

ICML
2006
IEEE

156views Machine Learning» more ICML 2006»

Learning the structure of Factored Markov Decision Processes in reinforcement learning problems

14 years 5 months ago

Download animatlab.lip6.fr

Recent decision-theoric planning algorithms are able to find optimal solutions in large problems, using Factored Markov Decision Processes (fmdps). However, these algorithms need ...

Thomas Degris, Olivier Sigaud, Pierre-Henri Wuille...

claim paper

Read More »

click to vote

AAAI
2006

123views Intelligent Agents» more AAAI 2006»

An Iterative Algorithm for Solving Constrained Decentralized Markov Decision Processes

13 years 6 months ago

Download www.aaai.org

Despite the significant progress to extend Markov Decision Processes (MDP) to cooperative multi-agent systems, developing approaches that can deal with realistic problems remains ...

Aurélie Beynier, Abdel-Illah Mouaddib

claim paper

Read More »

click to vote

DAGSTUHL
2007

74views Software Engineering» more DAGSTUHL 2007»

A policy iteration algorithm for Markov decision processes skip-free in one direction

13 years 6 months ago

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers