Search Sciweavers | Sciweavers

334 search results - page 27 / 67

» How to Dynamically Merge Markov Decision Processes

click to vote

QEST
2010
IEEE

154views Modeling and Simulation» more QEST 2010»

Symblicit Calculation of Long-Run Averages for Concurrent Probabilistic Systems

14 years 7 months ago

Download www.informatik.uni-freiburg.de

Abstract--Model checkers for concurrent probabilistic systems have become very popular within the last decade. The study of long-run average behavior has however received only scan...

Ralf Wimmer, Bettina Braitling, Bernd Becker, Erns...

claim paper

Read More »

179

click to vote

Publication

151views

Robust Bayesian reinforcement learning through tight lower bounds

13 years 8 months ago

Download arxiv.org

In the Bayesian approach to sequential decision making, exact calculation of the (subjective) utility is intractable. This extends to most special cases of interest, such as reinfo...

Christos Dimitrakakis

posted by olethros

Read More »

click to vote

AAAI
2004

167views Intelligent Agents» more AAAI 2004»

Dynamic Programming for Partially Observable Stochastic Games

14 years 11 months ago

Download anytime.cs.umass.edu

We develop an exact dynamic programming algorithm for partially observable stochastic games (POSGs). The algorithm is a synthesis of dynamic programming for partially observable M...

Eric A. Hansen, Daniel S. Bernstein, Shlomo Zilber...

claim paper

Read More »

111

click to vote

AAAI
2011

136views Intelligent Agents» more AAAI 2011»

Linear Dynamic Programs for Resource Management

13 years 9 months ago

Download www.cs.umass.edu

Sustainable resource management in many domains presents large continuous stochastic optimization problems, which can often be modeled as Markov decision processes (MDPs). To solv...

Marek Petrik, Shlomo Zilberstein

claim paper

Read More »

click to vote

NIPS
2001

192views Information Technology» more NIPS 2001»

Predictive Representations of State

14 years 11 months ago

Download www.eecs.umich.edu

We show that states of a dynamical system can be usefully represented by multi-step, action-conditional predictions of future observations. State representations that are grounded...

Michael L. Littman, Richard S. Sutton, Satinder P....

claim paper

Read More »

« Prev « First page 27 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers