Search Sciweavers | Sciweavers

201 search results - page 17 / 41

» Solving Concurrent Markov Decision Processes

click to vote

CORR
2008
Springer

122views Education» more CORR 2008»

Strategy Improvement for Concurrent Safety Games

14 years 12 months ago

Download www.soe.ucsc.edu

We consider concurrent games played on graphs. At every round of the game, each player simultaneously and independently selects a move; the moves jointly determine the transition ...

Krishnendu Chatterjee, Luca de Alfaro, Thomas A. H...

claim paper

Read More »

click to vote

IJCAI
2001

115views Artificial Intelligence» more IJCAI 2001»

Adaptive Control of Acyclic Progressive Processing Task Structures

15 years 1 months ago

Download anytime.cs.umass.edu

The progressive processing model allows a system to trade off resource consumption against the quality of the outcome by mapping each activity to a graph of potential solution met...

Stéphane Cardon, Abdel-Illah Mouaddib, Shlo...

claim paper

Read More »

100

click to vote

PUK
2000

130views Computer Science» more PUK 2000»

Dynamic Scheduling of Progressive Processing Plans

15 years 1 months ago

Download www-is.informatik.uni-oldenburg.de

Progressive processing plans allow systems to tradeoff computational resources against the quality of service by specifying alternative ways in which to accomplish each step. When ...

Shlomo Zilberstein, Abdel-Illah Mouaddib, Andrew A...

claim paper

Read More »

click to vote

ECML
2006
Springer

88views Machine Learning» more ECML 2006»

Reinforcement Learning for MDPs with Constraints

15 years 1 months ago

Download www.peter-geibel.de

In this article, I will consider Markov Decision Processes with two criteria, each defined as the expected value of an infinite horizon cumulative return. The second criterion is e...

Peter Geibel

claim paper

Read More »

click to vote

ICML
2006
IEEE

101views Machine Learning» more ICML 2006»

Qualitative reinforcement learning

16 years 19 days ago

Download www.cs.uiuc.edu

When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...

Arkady Epshteyn, Gerald DeJong

claim paper

Read More »

« Prev « First page 17 / 41 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers