Search Sciweavers | Sciweavers

17 search results - page 1 / 4

» Analysis of a Classification-based Policy Iteration Algorith...

click to vote

ICML
2010
IEEE

195views Machine Learning» more ICML 2010»

Analysis of a Classification-based Policy Iteration Algorithm

15 years 3 months ago

Download www.femto-st.fr

Alessandro Lazaric, Mohammad Ghavamzadeh, Ré...

claim paper

Read More »

click to vote

ESOP
2007
Springer

152views Programming Languages» more ESOP 2007»

Static Analysis by Policy Iteration on Relational Domains

15 years 8 months ago

Download minimal.inria.fr

We give a new practical algorithm to compute, in ﬁnite time, a ﬁxpoint (and often the least ﬁxpoint) of a system of equations in the abstract numerical domains of zones and t...

Stephane Gaubert, Eric Goubault, Ankur Taly, Sarah...

claim paper

Read More »

137

click to vote

VALUETOOLS
2006
ACM

176views Hardware» more VALUETOOLS 2006»

How to solve large scale deterministic games with mean payoff by policy iteration

15 years 8 months ago

Download minimal.inria.fr

Min-max functions are dynamic programming operators of zero-sum deterministic games with ﬁnite state and action spaces. The problem of computing the linear growth rate of the or...

Vishesh Dhingra, Stephane Gaubert

claim paper

Read More »

116

Voted

ICML
2009
IEEE

172views Machine Learning» more ICML 2009»

Model-free reinforcement learning as mixture learning

16 years 3 months ago

Download user.cs.tu-berlin.de

We cast model-free reinforcement learning as the problem of maximizing the likelihood of a probabilistic mixture model via sampling, addressing both the infinite and finite horizo...

Nikos Vlassis, Marc Toussaint

claim paper

Read More »

138

click to vote

ICML
2006
IEEE

143views Machine Learning» more ICML 2006»

Fast direct policy evaluation using multiscale analysis of Markov diffusion processes

16 years 3 months ago

Download www.cs.umass.edu

Policy evaluation is a critical step in the approximate solution of large Markov decision processes (MDPs), typically requiring O(|S|3 ) to directly solve the Bellman system of |S...

Mauro Maggioni, Sridhar Mahadevan

claim paper

Read More »

« Prev « First page 1 / 4 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers