Markov decision processes

10

AIPS
1998

127views Artificial Intelligence» more AIPS 1998»

Solving Stochastic Planning Problems with Large State and Action Spaces

13 years 5 months ago

Planning methods for deterministic planning problems traditionally exploit factored representations to encode the dynamics of problems in terms of a set of parameters, e.g., the l...

Thomas Dean, Robert Givan, Kee-Eung Kim

claim paper

Read More »

18

click to vote

IJCAI
2007

194views Artificial Intelligence» more IJCAI 2007»

Average-Reward Decentralized Markov Decision Processes

13 years 6 months ago

Download anytime.cs.umass.edu

Formal analysis of decentralized decision making has become a thriving research area in recent years, producing a number of multi-agent extensions of Markov decision processes. Wh...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

16

click to vote

IJCAI
2007

154views Artificial Intelligence» more IJCAI 2007»

A Hybridized Planner for Stochastic Domains

13 years 6 months ago

Download www.ijcai.org

Markov Decision Processes are a powerful framework for planning under uncertainty, but current algorithms have difﬁculties scaling to large problems. We present a novel probabil...

Mausam, Piergiorgio Bertoli, Daniel S. Weld

claim paper

Read More »

15

click to vote

ATAL
2008
Springer

138views Intelligent Agents» more ATAL 2008»

Reinforcement learning for DEC-MDPs with changing action sets and partially ordered dependencies

13 years 6 months ago

Download ml.informatik.uni-freiburg.de

Decentralized Markov decision processes are frequently used to model cooperative multi-agent systems. In this paper, we identify a subclass of general DEC-MDPs that features regul...

Thomas Gabel, Martin A. Riedmiller

claim paper

Read More »

13

click to vote

AIPS
2007

80views Artificial Intelligence» more AIPS 2007»

Prioritizing Bellman Backups without a Priority Queue

13 years 6 months ago

Download www.cs.washington.edu

Several researchers have shown that the efﬁciency of value iteration, a dynamic programming algorithm for Markov decision processes, can be improved by prioritizing the order of...

Peng Dai, Eric A. Hansen

claim paper

Read More »

10

click to vote

SARA
2005
Springer

102views Artificial Intelligence» more SARA 2005»

Feature-Discovering Approximate Value Iteration Methods

13 years 10 months ago

Download cobweb.ecn.purdue.edu

Sets of features in Markov decision processes can play a critical role ximately representing value and in abstracting the state space. Selection of features is crucial to the succe...

Jia-Hong Wu, Robert Givan

claim paper

Read More »

18

click to vote

QEST
2006
IEEE

162views Modeling and Simulation» more QEST 2006»

LiQuor: A tool for Qualitative and Quantitative Linear Time analysis of Reactive Systems

13 years 10 months ago

Download www.win.tue.nl

LiQuor is a tool for verifying probabilistic reactive systems modelled Probmela programs, which are terms of a probabilistic guarded command language with an operational semantics...

Frank Ciesinski, Christel Baier

claim paper

Read More »

13

click to vote

LICS
2009
IEEE

103views Automated Reasoning» more LICS 2009»

Statistic Analysis for Probabilistic Processes

13 years 11 months ago

Download www.lri.fr

—We associate a statistical vector to a trace and a geometrical embedding to a Markov Decision Process, based on a distance on words, and study basic Membership and Equivalence p...

Michel de Rougemont, Mathieu Tracol

claim paper

Read More »

17

click to vote

ALT
2006
Springer

111views Machine Learning» more ALT 2006»

Asymptotic Learnability of Reinforcement Problems with Arbitrary Dependence

14 years 1 months ago

Download www.idsia.ch

We address the problem of reinforcement learning in which observations may exhibit an arbitrary form of stochastic dependence on past observations and actions. The task for an age...

Daniil Ryabko, Marcus Hutter

claim paper

Read More »

15

click to vote

VMCAI
2010
Springer

204views Software Engineering» more VMCAI 2010»

Best Probabilistic Transformers

14 years 1 months ago

Download rw4.cs.uni-sb.de

This paper investigates relative precision and optimality of analyses for concurrent probabilistic systems. Aiming at the problem at the heart of probabilistic model checking ? com...

Björn Wachter, Lijun Zhang

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers