Markov decision process

98

LICS
2009
IEEE

103views Automated Reasoning» more LICS 2009»

Statistic Analysis for Probabilistic Processes

15 years 8 months ago

—We associate a statistical vector to a trace and a geometrical embedding to a Markov Decision Process, based on a distance on words, and study basic Membership and Equivalence p...

Michel de Rougemont, Mathieu Tracol

claim paper

Read More »

115

click to vote

IUI
2010
ACM

207views Software Engineering» more IUI 2010»

A POMDP approach to P300-based brain-computer interfaces

15 years 10 months ago

Download ailab.kaist.ac.kr

Most of the previous work on non-invasive brain-computer interfaces (BCIs) has been focused on feature extraction and classification algorithms to achieve high performance for the...

Jaeyoung Park, Kee-Eung Kim, Sungho Jo

claim paper

Read More »

108

click to vote

PERCOM
2007
ACM

189views Computer Networks» more PERCOM 2007»

Sensor Scheduling for Optimal Observability Using Estimation Entropy

16 years 27 days ago

Download people.eng.unimelb.edu.au

We consider sensor scheduling as the optimal observability problem for partially observable Markov decision processes (POMDP). This model fits to the cases where a Markov process ...

Mohammad Rezaeian

claim paper

Read More »

102

Voted

ICML
2006
IEEE

142views Machine Learning» more ICML 2006»

An intrinsic reward mechanism for efficient exploration

16 years 2 months ago

Download www-anw.cs.umass.edu

How should a reinforcement learning agent act if its sole purpose is to efficiently learn an optimal policy for later use? In other words, how should it explore, to be able to exp...

Özgür Simsek, Andrew G. Barto

claim paper

Read More »

107

Voted

ICML
2006
IEEE

131views Machine Learning» more ICML 2006»

PAC model-free reinforcement learning

16 years 2 months ago

Download cseweb.ucsd.edu

For a Markov Decision Process with finite state (size S) and action spaces (size A per state), we propose a new algorithm--Delayed Q-Learning. We prove it is PAC, achieving near o...

Alexander L. Strehl, Lihong Li, Eric Wiewiora, Joh...

claim paper

Read More »

84

click to vote

ICML
2006
IEEE

101views Machine Learning» more ICML 2006»

Qualitative reinforcement learning

16 years 2 months ago

Download www.cs.uiuc.edu

When the transition probabilities and rewards of a Markov Decision Process are specified exactly, the problem can be solved without any interaction with the environment. When no s...

Arkady Epshteyn, Gerald DeJong

claim paper

Read More »

144

click to vote

DAC
2000
ACM

179views Computer Architecture» more DAC 2000»

Dynamic power management of complex systems using generalized stochastic Petri nets

16 years 2 months ago

Download atrak.usc.edu

In this paper, we introduce a new technique for modeling and solving the dynamic power management (DPM) problem for systems with complex behavioral characteristics such as concurr...

Qinru Qiu, Qing Wu, Massoud Pedram

claim paper

Read More »

144

click to vote

ICIP
2009
IEEE

420views Image Processing» more ICIP 2009»

A Robust Framework For Aligning Lecture Slides With Video

16 years 2 months ago

Download www.comp.nus.edu.sg

We propose a robust approach for aligning lecture slides with lecture videos using a combination of Hough transform, optical flow and Gabor analysis. A Markov Decision Process mod...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers