Search Sciweavers | Sciweavers

58 search results - page 3 / 12

» Approximate Solution Techniques for Factored First-Order MDP...

click to vote

AIPS
2004

142views Artificial Intelligence» more AIPS 2004»

Heuristic Refinements of Approximate Linear Programming for Factored Continuous-State Markov Decision Processes

13 years 6 months ago

Download www.cs.pitt.edu

Approximate linear programming (ALP) offers a promising framework for solving large factored Markov decision processes (MDPs) with both discrete and continuous states. Successful ...

Branislav Kveton, Milos Hauskrecht

claim paper

Read More »

click to vote

AAAI
2010

136views Intelligent Agents» more AAAI 2010»

Robust Policy Computation in Reward-Uncertain MDPs Using Nondominated Policies

13 years 6 months ago

Download www.cs.toronto.edu

The precise specification of reward functions for Markov decision processes (MDPs) is often extremely difficult, motivating research into both reward elicitation and the robust so...

Kevin Regan, Craig Boutilier

claim paper

Read More »

click to vote

ATAL
2003
Springer

152views Intelligent Agents» more ATAL 2003»

Transition-independent decentralized markov decision processes

13 years 10 months ago

Download anytime.cs.umass.edu

There has been substantial progress with formal models for sequential decision making by individual agents using the Markov decision process (MDP). However, similar treatment of m...

Raphen Becker, Shlomo Zilberstein, Victor R. Lesse...

claim paper

Read More »

click to vote

AAAI
1998

129views Intelligent Agents» more AAAI 1998»

Solving Very Large Weakly Coupled Markov Decision Processes

13 years 6 months ago

Download www.cs.toronto.edu

We present a technique for computing approximately optimal solutions to stochastic resource allocation problems modeled as Markov decision processes (MDPs). We exploit two key pro...

Nicolas Meuleau, Milos Hauskrecht, Kee-Eung Kim, L...

claim paper

Read More »

click to vote

AAAI
2007

102views Intelligent Agents» more AAAI 2007»

Thresholded Rewards: Acting Optimally in Timed, Zero-Sum Games

13 years 7 months ago

Download www.cs.cmu.edu

In timed, zero-sum games, the goal is to maximize the probability of winning, which is not necessarily the same as maximizing our expected reward. We consider cumulative intermedi...

Colin McMillen, Manuela M. Veloso

claim paper

Read More »

« Prev « First page 3 / 12 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers