Search Sciweavers | Sciweavers

1277 search results - page 151 / 256

» Terminating Decision Algorithms Optimally

128

click to vote

ICMLA
2008

106views Machine Learning» more ICMLA 2008»

Prediction-Directed Compression of POMDPs

15 years 5 months ago

Download damas.ift.ulaval.ca

High dimensionality of belief space in Partially Observable Markov Decision Processes (POMDPs) is one of the major causes that severely restricts the applicability of this model. ...

Abdeslam Boularias, Masoumeh T. Izadi, Brahim Chai...

claim paper

Read More »

136

Voted

IJCAI
2001

185views Artificial Intelligence» more IJCAI 2001»

Symbolic Dynamic Programming for First-Order MDPs

15 years 5 months ago

Download www.cs.toronto.edu

We present a dynamic programming approach for the solution of first-order Markov decisions processes. This technique uses an MDP whose dynamics is represented in a variant of the ...

Craig Boutilier, Raymond Reiter, Bob Price

claim paper

Read More »

123

Voted

FLAIRS
2009

156views Artificial Intelligence» more FLAIRS 2009»

Dynamic Programming Approximations for Partially Observable Stochastic Games

15 years 1 months ago

Download rbr.cs.umass.edu

Partially observable stochastic games (POSGs) provide a rich mathematical framework for planning under uncertainty by a group of agents. However, this modeling advantage comes wit...

Akshat Kumar, Shlomo Zilberstein

claim paper

Read More »

120

click to vote

JMLR
2010

125views more JMLR 2010»

Variational methods for Reinforcement Learning

14 years 10 months ago

Download jmlr.csail.mit.edu

We consider reinforcement learning as solving a Markov decision process with unknown transition distribution. Based on interaction with the environment, an estimate of the transit...

Thomas Furmston, David Barber

claim paper

Read More »

136

Voted

ATAL
2003
Springer

185views Intelligent Agents» more ATAL 2003»

Optimizing information exchange in cooperative multi-agent systems

15 years 8 months ago

Download rbr.cs.umass.edu

Decentralized control of a cooperative multi-agent system is the problem faced by multiple decision-makers that share a common set of objectives. The decision-makers may be robots...

Claudia V. Goldman, Shlomo Zilberstein

claim paper

Read More »

« Prev « First page 151 / 256 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers