Search Sciweavers | Sciweavers

334 search results - page 19 / 67

» How to Dynamically Merge Markov Decision Processes

click to vote

IJCAI
2007

254views Artificial Intelligence» more IJCAI 2007»

Bayesian Inverse Reinforcement Learning

14 years 11 months ago

Download www.ijcai.org

Inverse Reinforcement Learning (IRL) is the problem of learning the reward function underlying a Markov Decision Process given the dynamics of the system and the behaviour of an e...

Deepak Ramachandran, Eyal Amir

claim paper

Read More »

click to vote

PKDD
2009
Springer

129views Data Mining» more PKDD 2009»

Considering Unseen States as Impossible in Factored Reinforcement Learning

15 years 4 months ago

Download www-desir.lip6.fr

Abstract. The Factored Markov Decision Process (FMDP) framework is a standard representation for sequential decision problems under uncertainty where the state is represented as a ...

Olga Kozlova, Olivier Sigaud, Pierre-Henri Wuillem...

claim paper

Read More »

click to vote

ISSS
1999
IEEE

121views Hardware» more ISSS 1999»

Event-Driven Power Management of Portable Systems

15 years 1 months ago

Download si2.epfl.ch

The policy optimization problem for dynamic power management has received considerable attention in the recent past. We formulate policy optimization as a constrained optimization...

Tajana Simunic, Giovanni De Micheli, Luca Benini

claim paper

Read More »

click to vote

FSTTCS
2006
Springer

149views Software Engineering» more FSTTCS 2006»

Testing Probabilistic Equivalence Through Reinforcement Learning

15 years 1 months ago

Download www2.ift.ulaval.ca

We propose a new approach to verification of probabilistic processes for which the model may not be available. We use a technique from Reinforcement Learning to approximate how far...

Josee Desharnais, François Laviolette, Sami...

claim paper

Read More »

click to vote

HYBRID
2010
Springer

160views Control Systems» more HYBRID 2010»

On the connections between PCTL and dynamic programming

15 years 4 months ago

Download www2.ee.kth.se

Probabilistic Computation Tree Logic (PCTL) is a wellknown modal logic which has become a standard for expressing temporal properties of ﬁnite-state Markov chains in the context...

Federico Ramponi, Debasish Chatterjee, Sean Summer...

claim paper

Read More »

« Prev « First page 19 / 67 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers