Search Sciweavers | Sciweavers

829 search results - page 7 / 166

» A time aggregation approach to Markov decision processes

134

click to vote

NIPS
2004

103views Information Technology» more NIPS 2004»

Experts in a Markov Decision Process

15 years 3 months ago

Download books.nips.cc

We consider an MDP setting in which the reward function is allowed to change during each time step of play (possibly in an adversarial manner), yet the dynamics remain fixed. Simi...

Eyal Even-Dar, Sham M. Kakade, Yishay Mansour

claim paper

Read More »

127

click to vote

UAI
2000

168views Artificial Intelligence» more UAI 2000»

The Complexity of Decentralized Control of Markov Decision Processes

15 years 3 months ago

Download www.cs.umass.edu

We consider decentralized control of Markov decision processes and give complexity bounds on the worst-case running time for algorithms that find optimal solutions. Generalization...

Daniel S. Bernstein, Shlomo Zilberstein, Neil Imme...

claim paper

Read More »

114

Voted

ML
2002
ACM

143views Machine Learning» more ML 2002»

A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes

15 years 1 months ago

Download www.cis.upenn.edu

An issue that is critical for the application of Markov decision processes MDPs to realistic problems is how the complexity of planning scales with the size of the MDP. In stochas...

Michael J. Kearns, Yishay Mansour, Andrew Y. Ng

claim paper

Read More »

Voted

CDC
2008
IEEE

140views Control Systems» more CDC 2008»

Information state for Markov decision processes with network delays

15 years 8 months ago

Download wsl.stanford.edu

We consider a networked control system, where each subsystem evolves as a Markov decision process (MDP). Each subsystem is coupled to its neighbors via communication links over wh...

Sachin Adlakha, Sanjay Lall, Andrea J. Goldsmith

claim paper

Read More »

109

Voted

AAAI
2004

108views Intelligent Agents» more AAAI 2004»

Solving Generalized Semi-Markov Decision Processes Using Continuous Phase-Type Distributions

15 years 3 months ago

Download www.aaai.org

We introduce the generalized semi-Markov decision process (GSMDP) as an extension of continuous-time MDPs and semi-Markov decision processes (SMDPs) for modeling stochastic decisi...

Håkan L. S. Younes, Reid G. Simmons

claim paper

Read More »

« Prev « First page 7 / 166 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers