Search Sciweavers | Sciweavers

162 search results - page 20 / 33

» Topological Value Iteration Algorithm for Markov Decision Pr...

click to vote

PERCOM
2007
ACM

189views Computer Networks» more PERCOM 2007»

Sensor Scheduling for Optimal Observability Using Estimation Entropy

15 years 11 months ago

Download people.eng.unimelb.edu.au

We consider sensor scheduling as the optimal observability problem for partially observable Markov decision processes (POMDP). This model fits to the cases where a Markov process ...

Mohammad Rezaeian

claim paper

Read More »

click to vote

AIPS
2007

104views Artificial Intelligence» more AIPS 2007»

Discovering Relational Domain Features for Probabilistic Planning

15 years 2 months ago

Download cobweb.ecn.purdue.edu

In sequential decision-making problems formulated as Markov decision processes, state-value function approximation using domain features is a critical technique for scaling up the...

Jia-Hong Wu, Robert Givan

claim paper

Read More »

112

click to vote

QEST
2010
IEEE

154views Modeling and Simulation» more QEST 2010»

Symblicit Calculation of Long-Run Averages for Concurrent Probabilistic Systems

14 years 9 months ago

Download www.informatik.uni-freiburg.de

Abstract--Model checkers for concurrent probabilistic systems have become very popular within the last decade. The study of long-run average behavior has however received only scan...

Ralf Wimmer, Bettina Braitling, Bernd Becker, Erns...

claim paper

Read More »

101

click to vote

ICML
1996
IEEE

196views Machine Learning» more ICML 1996»

A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning

15 years 3 months ago

Download www.ri.cmu.edu

This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...

Rémi Munos

claim paper

Read More »

113

click to vote

PRICAI
2000
Springer

193views Artificial Intelligence» more PRICAI 2000»

Generating Hierarchical Structure in Reinforcement Learning from State Variables

15 years 3 months ago

Download www.csee.umbc.edu

This paper presents the CQ algorithm which decomposes and solves a Markov Decision Process (MDP) by automatically generating a hierarchy of smaller MDPs using state variables. The ...

Bernhard Hengst

claim paper

Read More »

« Prev « First page 20 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers