Search Sciweavers | Sciweavers

105

ALT
2007
Springer

119views Machine Learning» more ALT 2007»

Pseudometrics for State Aggregation in Average Reward Markov Decision Processes

15 years 10 months ago

We consider how state similarity in average reward Markov decision processes (MDPs) may be described by pseudometrics. Introducing the notion of adequate pseudometrics which are we...

Ronald Ortner

claim paper

Read More »

113

click to vote

AIPS
2004

142views Artificial Intelligence» more AIPS 2004»

Heuristic Refinements of Approximate Linear Programming for Factored Continuous-State Markov Decision Processes

15 years 2 months ago

Download www.cs.pitt.edu

Approximate linear programming (ALP) offers a promising framework for solving large factored Markov decision processes (MDPs) with both discrete and continuous states. Successful ...

Branislav Kveton, Milos Hauskrecht

claim paper

Read More »

111

click to vote

CAV
2007
Springer

112views Hardware» more CAV 2007»

Magnifying-Lens Abstraction for Markov Decision Processes

15 years 7 months ago

Download www.ee.ucla.edu

ng-Lens Abstraction for Markov Decision Processes⋆ In Proc. of CAV 2007: 19th International Conference on Computer-Aided Veriﬁcation, Lectures Notes in Computer Science. c Spri...

Luca de Alfaro, Pritam Roy

claim paper

Read More »

123

click to vote

SARA
2007
Springer

167views Artificial Intelligence» more SARA 2007»

Active Learning of Dynamic Bayesian Networks in Markov Decision Processes

15 years 7 months ago

Download www-anw.cs.umass.edu

Several recent techniques for solving Markov decision processes use dynamic Bayesian networks to compactly represent tasks. The dynamic Bayesian network representation may not be g...

Anders Jonsson, Andrew G. Barto

claim paper

Read More »

101

click to vote

ECML
2005
Springer

143views Machine Learning» more ECML 2005»

Active Learning in Partially Observable Markov Decision Processes

15 years 7 months ago

Download www.cs.mcgill.ca

This paper examines the problem of ﬁnding an optimal policy for a Partially Observable Markov Decision Process (POMDP) when the model is not known or is only poorly speciﬁed. W...

Robin Jaulmes, Joelle Pineau, Doina Precup

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers