Search Sciweavers | Sciweavers

682 search results - page 29 / 137

» One-Counter Markov Decision Processes

132

click to vote

AI
2006
Springer

103views Artificial Intelligence» more AI 2006»

Trace Equivalence Characterization Through Reinforcement Learning

15 years 9 months ago

Download www2.ift.ulaval.ca

In the context of probabilistic verification, we provide a new notion of trace-equivalence divergence between pairs of Labelled Markov processes. This divergence corresponds to the...

Josee Desharnais, François Laviolette, Kris...

claim paper

Read More »

190

click to vote

CORR
2012
Springer

235views Education» more CORR 2012»

An Incremental Sampling-based Algorithm for Stochastic Optimal Control

14 years 1 months ago

Download www.mit.edu

Abstract— In this paper, we consider a class of continuoustime, continuous-space stochastic optimal control problems. Building upon recent advances in Markov chain approximation ...

Vu Anh Huynh, Sertac Karaman, Emilio Frazzoli

claim paper

Read More »

281

click to vote

Publication

233views

Sparse reward processes

14 years 4 months ago

Download arxiv.org

We introduce a class of learning problems where the agent is presented with a series of tasks. Intuitively, if there is relation among those tasks, then the information gained duri...

Christos Dimitrakakis

posted by olethros

Read More »

207

click to vote

INFOCOM
2012
IEEE

211views Communications» more INFOCOM 2012»

Delay optimal multichannel opportunistic access

13 years 8 months ago

Download www.ece.ucdavis.edu

Abstract—The problem of minimizing queueing delay of opportunistic access of multiple continuous time Markov channels is considered. A new access policy based on myopic sensing a...

Shiyao Chen, Lang Tong, Qing Zhao

claim paper

Read More »

141

click to vote

UAI
2004

131views Artificial Intelligence» more UAI 2004»

Dynamic Programming for Structured Continuous Markov Decision Problems

15 years 7 months ago

Download www.cs.bham.ac.uk

We describe an approach for exploiting structure in Markov Decision Processes with continuous state variables. At each step of the dynamic programming, the state space is dynamica...

Zhengzhu Feng, Richard Dearden, Nicolas Meuleau, R...

claim paper

Read More »

« Prev « First page 29 / 137 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers