Sciweavers

52 search results - page 1 / 11
» Error Bounds for Approximate Policy Iteration
Sort
View
67
Voted
ICML
2003
IEEE
15 years 10 months ago
Error Bounds for Approximate Policy Iteration
Rémi Munos
ML
2008
ACM
152views Machine Learning» more  ML 2008»
14 years 9 months ago
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
Abstract. We consider batch reinforcement learning problems in continuous space, expected total discounted-reward Markovian Decision Problems. As opposed to previous theoretical wo...
András Antos, Csaba Szepesvári, R&ea...

Publication
222views
15 years 6 months ago
Algorithms and Bounds for Rollout Sampling Approximate Policy Iteration
Abstract: Several approximate policy iteration schemes without value functions, which focus on policy representation using classifiers and address policy learning as a supervis...
Christos Dimitrakakis, Michail G. Lagoudakis
NIPS
2003
14 years 11 months ago
Bounded Finite State Controllers
We describe a new approximation algorithm for solving partially observable MDPs. Our bounded policy iteration approach searches through the space of bounded-size, stochastic fini...
Pascal Poupart, Craig Boutilier
ICIP
1999
IEEE
15 years 11 months ago
Efficient Approximation of Gray-Scale Images Through Bounded Error Triangular Meshes
This paper presents an iterative algorithm for approximating gray-scale images with adaptive triangular meshes ensuring a given tolerance. At each iteration, the algorithm applies...
Angel Domingo Sappa, Boris Xavier Vintimilla, Migu...