Search Sciweavers | Sciweavers

162 search results - page 4 / 33

» Topological Value Iteration Algorithm for Markov Decision Pr...

188

click to vote

AI
2008
Springer

123views Artificial Intelligence» more AI 2008»

Reachability analysis of uncertain systems using bounded-parameter Markov decision processes

15 years 6 months ago

Download www.vuse.vanderbilt.edu

Verification of reachability properties for probabilistic systems is usually based on variants of Markov processes. Current methods assume an exact model of the dynamic behavior a...

Di Wu, Xenofon D. Koutsoukos

claim paper

Read More »

167

click to vote

ICML
2010
IEEE

219views Machine Learning» more ICML 2010»

Convergence of Least Squares Temporal Difference Methods Under General Conditions

15 years 7 months ago

Download www.cs.helsinki.fi

We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) in the off-policy learning context and with the simulation-based least square...

Huizhen Yu

claim paper

Read More »

180

click to vote

CORR
2010
Springer

105views Education» more CORR 2010»

Optimism in Reinforcement Learning Based on Kullback-Leibler Divergence

15 years 5 months ago

Download hal.archives-ouvertes.fr

We consider model-based reinforcement learning in ﬁnite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. Optimism is usually implemented by carryin...

Sarah Filippi, Olivier Cappé, Aurelien Gari...

claim paper

Read More »

196

click to vote

ICML
2006
IEEE

143views Machine Learning» more ICML 2006»

Fast direct policy evaluation using multiscale analysis of Markov diffusion processes

16 years 7 months ago

Download www.cs.umass.edu

Policy evaluation is a critical step in the approximate solution of large Markov decision processes (MDPs), typically requiring O(|S|3 ) to directly solve the Bellman system of |S...

Mauro Maggioni, Sridhar Mahadevan

claim paper

Read More »

198

click to vote

IJCAI
2007

170views Artificial Intelligence» more IJCAI 2007»

First Order Decision Diagrams for Relational MDPs

15 years 8 months ago

Download www.cs.tufts.edu

Dynamic programming algorithms provide a basic tool identifying optimal solutions in Markov Decision Processes (MDP). The paper develops a representation for decision diagrams sui...

Chenggang Wang, Saket Joshi, Roni Khardon

claim paper

Read More »

« Prev « First page 4 / 33 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers