Sciweavers

567 search results - page 3 / 114
» Regularized Policy Iteration
Sort
View
AUTOMATICA
2004
88views more  AUTOMATICA 2004»
15 years 4 days ago
Unconstrained optimal control of regular languages
This paper formulates an unconstrained optimal policy for control of regular languages realized as deterministic
Jinbo Fu, Asok Ray, Constantino M. Lagoa
107
Voted
NIPS
2003
15 years 1 months ago
Bounded Finite State Controllers
We describe a new approximation algorithm for solving partially observable MDPs. Our bounded policy iteration approach searches through the space of bounded-size, stochastic fini...
Pascal Poupart, Craig Boutilier
93
Voted
JMLR
2006
143views more  JMLR 2006»
15 years 7 days ago
Geometric Variance Reduction in Markov Chains: Application to Value Function and Gradient Estimation
We study a sequential variance reduction technique for Monte Carlo estimation of functionals in Markov Chains. The method is based on designing sequential control variates using s...
Rémi Munos
103
Voted
AUTOMATICA
2008
74views more  AUTOMATICA 2008»
15 years 12 days ago
Policy iteration based feedback control
It is well known that stochastic control systems can be viewed as Markov decision processes (MDPs) with continuous state spaces. In this paper, we propose to apply the policy iter...
Kan-Jian Zhang, Yan-Kai Xu, Xi Chen, Xi-Ren Cao
171
Voted

Publication
334views
15 years 9 months ago
Rollout Sampling Approximate Policy Iteration
Several researchers have recently investigated the connection between reinforcement learning and classification. We are motivated by proposals of approximate policy iteration schem...
Christos Dimitrakakis, Michail G. Lagoudakis