Search Sciweavers | Sciweavers

110

Voted

ECML
2005
Springer

120views Machine Learning» more ECML 2005»

Using Rewards for Belief State Updates in Partially Observable Markov Decision Processes

15 years 5 months ago

Partially Observable Markov Decision Processes (POMDP) provide a standard framework for sequential decision making in stochastic environments. In this setting, an agent takes actio...

Masoumeh T. Izadi, Doina Precup

claim paper

Read More »

89

click to vote

IJCAI
2007

194views Artificial Intelligence» more IJCAI 2007»

Average-Reward Decentralized Markov Decision Processes

15 years 1 months ago

Download anytime.cs.umass.edu

Formal analysis of decentralized decision making has become a thriving research area in recent years, producing a number of multi-agent extensions of Markov decision processes. Wh...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

104

click to vote

AAAI
1996

119views Intelligent Agents» more AAAI 1996»

Rewarding Behaviors

15 years 1 months ago

Download www.cs.toronto.edu

Markov decision processes (MDPs) are a very popular tool for decision theoretic planning (DTP), partly because of the welldeveloped, expressive theory that includes effective solu...

Fahiem Bacchus, Craig Boutilier, Adam J. Grove

claim paper

Read More »

111

click to vote

WSC
2001

120views Modeling And Simulation» more WSC 2001»

On improving the performance of simulation-based algorithms for average reward processes with application to network pricing

15 years 1 months ago

Download home.gwu.edu

We address performance issues associated with simulationbased algorithms for optimizing Markov reward processes. Specifically, we are concerned with algorithms that exploit the re...

Enrique Campos-Náñez, Stephen D. Pat...

claim paper

Read More »

93

click to vote

UAI
2003

87views Artificial Intelligence» more UAI 2003»

Implementation and Comparison of Solution Methods for Decision Processes with Non-Markovian Rewards

15 years 1 months ago

Download users.cecs.anu.edu.au

This paper examines a number of solution methods for decision processes with non-Markovian rewards (NMRDPs). They all exploit a temporal logic speciﬁcation of the reward functio...

Charles Gretton, David Price, Sylvie Thiéba...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers