Search Sciweavers | Sciweavers

138 search results - page 5 / 28

» Dynamic Programming for Structured Continuous Markov Decisio...

221

click to vote

AIPS
2004

145views Artificial Intelligence» more AIPS 2004»

Optimal Resource Allocation and Policy Formulation in Loosely-Coupled Markov Decision Processes

15 years 8 months ago

Download www.aaai.org

The problem of optimal policy formulation for teams of resource-limited agents in stochastic environments is composed of two strongly-coupled subproblems: a resource allocation pr...

Dmitri A. Dolgov, Edmund H. Durfee

claim paper

Read More »

187

click to vote

AMAI
2006
Springer

123views Artificial Intelligence» more AMAI 2006»

Symmetric approximate linear programming for factored MDPs with application to constrained problems

15 years 7 months ago

Download ai.stanford.edu

A weakness of classical Markov decision processes (MDPs) is that they scale very poorly due to the flat state-space representation. Factored MDPs address this representational pro...

Dmitri A. Dolgov, Edmund H. Durfee

claim paper

Read More »

180

Voted

ATAL
2008
Springer

116views Intelligent Agents» more ATAL 2008»

Controlling deliberation in a Markov decision process-based agent

15 years 9 months ago

Download coitweb.uncc.edu

Meta-level control manages the allocation of limited resources to deliberative actions. This paper discusses efforts in adding meta-level control capabilities to a Markov Decision...

George Alexander, Anita Raja, David J. Musliner

claim paper

Read More »

198

click to vote

IJCAI
2003

142views Artificial Intelligence» more IJCAI 2003»

Taming Decentralized POMDPs: Towards Efficient Policy Computation for Multiagent Settings

15 years 8 months ago

Download dli.iiit.ac.in

The problem of deriving joint policies for a group of agents that maximize some joint reward function can be modeled as a decentralized partially observable Markov decision proces...

Ranjit Nair, Milind Tambe, Makoto Yokoo, David V. ...

claim paper

Read More »

218

click to vote

ICML
2006
IEEE

256views Machine Learning» more ICML 2006»

Automatic basis function construction for approximate dynamic programming and reinforcement learning

16 years 26 days ago

Download www.ece.mcgill.ca

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov Decision Process (MDP). Our work builds on results ...

Philipp W. Keller, Shie Mannor, Doina Precup

claim paper

Read More »

« Prev « First page 5 / 28 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers