Search Sciweavers | Sciweavers

48 search results - page 3 / 10

» Metrics for Finite Markov Decision Processes

106

click to vote

ICML
2010
IEEE

219views Machine Learning» more ICML 2010»

Convergence of Least Squares Temporal Difference Methods Under General Conditions

15 years 2 months ago

Download www.cs.helsinki.fi

We consider approximate policy evaluation for finite state and action Markov decision processes (MDP) in the off-policy learning context and with the simulation-based least square...

Huizhen Yu

claim paper

Read More »

108

click to vote

CORR
2008
Springer

91views Education» more CORR 2008»

Significant Diagnostic Counterexamples in Probabilistic Model Checking

15 years 1 months ago

Download www.cs.ru.nl

Abstract. This paper presents a novel technique for counterexample generation in probabilistic model checking of Markov chains and Markov Decision Processes. (Finite) paths in coun...

Miguel E. Andrés, Pedro R. D'Argenio, Peter...

claim paper

Read More »

click to vote

AIPS
2003

149views Artificial Intelligence» more AIPS 2003»

Synthesis of Hierarchical Finite-State Controllers for POMDPs

15 years 2 months ago

Download www.aaai.org

We develop a hierarchical approach to planning for partially observable Markov decision processes (POMDPs) in which a policy is represented as a hierarchical ﬁnite-state control...

Eric A. Hansen, Rong Zhou

claim paper

Read More »

101

click to vote

CALCO
2007
Springer

100views Mathematics» more CALCO 2007»

Applications of Metric Coinduction

15 years 7 months ago

Download www.cs.cornell.edu

Metric coinduction is a form of coinduction that can be used to establish properties of objects constructed as a limit of ﬁnite approximations. One can prove a coinduction step s...

Dexter Kozen, Nicholas Ruozzi

claim paper

Read More »

100

click to vote

ICML
2005
IEEE

133views Machine Learning» more ICML 2005»

A theoretical analysis of Model-Based Interval Estimation

16 years 2 months ago

Download paul.rutgers.edu

Several algorithms for learning near-optimal policies in Markov Decision Processes have been analyzed and proven efficient. Empirical results have suggested that Model-based Inter...

Alexander L. Strehl, Michael L. Littman

claim paper

Read More »

« Prev « First page 3 / 10 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers