Search Sciweavers | Sciweavers

132 search results - page 2 / 27

» Relational Partially Observable MDPs

click to vote

UAI
2000

133views Artificial Intelligence» more UAI 2000»

PEGASUS: A policy search method for large MDPs and POMDPs

13 years 5 months ago

Download ai.stanford.edu

We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a mo...

Andrew Y. Ng, Michael I. Jordan

claim paper

Read More »

click to vote

AI
2011
Springer

211views Artificial Intelligence» more AI 2011»

Decentralized MDPs with sparse interactions

12 years 7 months ago

Download www.inesc-id.pt

In this work, we explore how local interactions can simplify the process of decision-making in multiagent systems, particularly in multirobot problems. We review a recent decision-...

Francisco S. Melo, Manuela M. Veloso

claim paper

Read More »

click to vote

AAAI
2011

136views Intelligent Agents» more AAAI 2011»

Linear Dynamic Programs for Resource Management

12 years 4 months ago

Download www.cs.umass.edu

Sustainable resource management in many domains presents large continuous stochastic optimization problems, which can often be modeled as Markov decision processes (MDPs). To solv...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

click to vote

NIPS
2003

180views Information Technology» more NIPS 2003»

Bounded Finite State Controllers

13 years 5 months ago

Download books.nips.cc

We describe a new approximation algorithm for solving partially observable MDPs. Our bounded policy iteration approach searches through the space of bounded-size, stochastic ﬁni...

Pascal Poupart, Craig Boutilier

claim paper

Read More »

click to vote

ICMLA
2009

181views Machine Learning» more ICMLA 2009»

Sensitivity Analysis of POMDP Value Functions

13 years 1 months ago

Download www.cs.cmu.edu

In sequential decision making under uncertainty, as in many other modeling endeavors, researchers observe a dynamical system and collect data measuring its behavior over time. The...

Stéphane Ross, Masoumeh T. Izadi, Mark Merc...

claim paper

Read More »

« Prev « First page 2 / 27 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers