Sciweavers

567 search results - page 3 / 114
» Regularized Policy Iteration
Sort
View
AUTOMATICA
2004
88views more  AUTOMATICA 2004»
13 years 4 months ago
Unconstrained optimal control of regular languages
This paper formulates an unconstrained optimal policy for control of regular languages realized as deterministic
Jinbo Fu, Asok Ray, Constantino M. Lagoa
NIPS
2003
13 years 6 months ago
Bounded Finite State Controllers
We describe a new approximation algorithm for solving partially observable MDPs. Our bounded policy iteration approach searches through the space of bounded-size, stochastic fini...
Pascal Poupart, Craig Boutilier
JMLR
2006
143views more  JMLR 2006»
13 years 4 months ago
Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation
We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...
Rémi Munos
AUTOMATICA
2008
74views more  AUTOMATICA 2008»
13 years 5 months ago
Policy iteration based feedback control
It is well known that stochastic control systems can be viewed as Markov decision processes (MDPs) with continuous state spaces. In this paper, we propose to apply the policy iter...
Kan-Jian Zhang, Yan-Kai Xu, Xi Chen, Xi-Ren Cao

Publication
334views
14 years 1 months ago
Rollout Sampling Approximate Policy Iteration
Several researchers have recently investigated the connection between reinforcement learning and classification. We are motivated by proposals of approximate policy iteration schem...
Christos Dimitrakakis, Michail G. Lagoudakis