Search Sciweavers | Sciweavers

47 search results - page 2 / 10

» Average-Reward Decentralized Markov Decision Processes

click to vote

ICML
2001
IEEE

172views Machine Learning» more ICML 2001»

Continuous-Time Hierarchical Reinforcement Learning

14 years 5 months ago

Download www.cs.ualberta.ca

Hierarchical reinforcement learning (RL) is a general framework which studies how to exploit the structure of actions and tasks to accelerate policy learning in large domains. Pri...

Mohammad Ghavamzadeh, Sridhar Mahadevan

claim paper

Read More »

click to vote

WSC
2008

154views Modeling And Simulation» more WSC 2008»

On step sizes, stochastic shortest paths, and survival probabilities in Reinforcement Learning

13 years 7 months ago

Download www.informs-sim.org

Reinforcement Learning (RL) is a simulation-based technique useful in solving Markov decision processes if their transition probabilities are not easily obtainable or if the probl...

Abhijit Gosavi

claim paper

Read More »

click to vote

ATAL
2003
Springer

152views Intelligent Agents» more ATAL 2003»

Transition-independent decentralized markov decision processes

13 years 10 months ago

Download anytime.cs.umass.edu

There has been substantial progress with formal models for sequential decision making by individual agents using the Markov decision process (MDP). However, similar treatment of m...

Raphen Becker, Shlomo Zilberstein, Victor R. Lesse...

claim paper

Read More »

click to vote

UAI
2000

168views Artificial Intelligence» more UAI 2000»

The Complexity of Decentralized Control of Markov Decision Processes

13 years 6 months ago

Download www.cs.umass.edu

We consider decentralized control of Markov decision processes and give complexity bounds on the worst-case running time for algorithms that find optimal solutions. Generalization...

Daniel S. Bernstein, Shlomo Zilberstein, Neil Imme...

claim paper

Read More »

click to vote

CDC
2010
IEEE

141views Control Systems» more CDC 2010»

A dynamic programming algorithm for decentralized Markov decision processes with a broadcast structure

12 years 12 months ago

Download junction.stanford.edu

We give an optimal dynamic programming algorithm to solve a class of finite-horizon decentralized Markov decision processes (MDPs). We consider problems with a broadcast informati...

Jeff Wu, Sanjay Lall

claim paper

Read More »

« Prev « First page 2 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers