Search Sciweavers | Sciweavers

52 search results - page 1 / 11

» Error Bounds for Approximate Policy Iteration

183

Voted

ICML
2003
IEEE

174views Machine Learning» more ICML 2003»

Error Bounds for Approximate Policy Iteration

16 years 8 months ago

Download www.aaai.org

Rémi Munos

claim paper

Read More »

247

click to vote

ML
2008
ACM

152views Machine Learning» more ML 2008»

Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path

15 years 7 months ago

Download hal.inria.fr

Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...

András Antos, Csaba Szepesvári, R&ea...

claim paper

Read More »

261

click to vote

Publication

222views

Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration

16 years 4 months ago

Download arxiv.org

Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...

Christos Dimitrakakis, Michail G. Lagoudakis

posted by olethros

Read More »

213

click to vote

NIPS
2003

180views Information Technology» more NIPS 2003»

Bounded Finite State Controllers

15 years 9 months ago

Download books.nips.cc

We describe a new approximation algorithm for solving partially observable MDPs. Our bounded policy iteration approach searches through the space of bounded-size, stochastic ﬁni...

Pascal Poupart, Craig Boutilier

claim paper

Read More »

195

Voted

ICIP
1999
IEEE

163views Image Processing» more ICIP 1999»

Efficient Approximation of Gray-Scale Images Through Bounded Error Triangular Meshes

16 years 9 months ago

Download deim.urv.cat

This paper presents an iterative algorithm for approximating gray-scale images with adaptive triangular meshes ensuring a given tolerance. At each iteration, the algorithm applies...

Angel Domingo Sappa, Boris Xavier Vintimilla, Migu...

claim paper

Read More »

« Prev « First page 1 / 11 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers