Sciweavers

567 search results - page 1 / 114
» Regularized Policy Iteration
Sort
View
NIPS
2008
13 years 5 months ago
Regularized Policy Iteration
In this paper we consider approximate policy-iteration-based reinforcement learning algorithms. In order to implement a flexible function approximation scheme we propose the use o...
Amir Massoud Farahmand, Mohammad Ghavamzadeh, Csab...
AUTOMATICA
2005
108views more  AUTOMATICA 2005»
13 years 4 months ago
Robust optimal control of regular languages
This paper presents an algorithm for robust optimal control of regular languages under specified uncertainty bounds on the event cost parameters of the language measure that has b...
Constantino M. Lagoa, Jinbo Fu, Asok Ray
CORR
2007
Springer
94views Education» more  CORR 2007»
13 years 4 months ago
Paging and Registration in Cellular Networks: Jointly Optimal Policies and an Iterative Algorithm
— This paper explores optimization of paging and registration policies in cellular networks. Motion is modeled as a discrete-time Markov process, and minimization of the discount...
Bruce Hajek, Kevin Mitzel, Sichao Yang
NA
2007
105views more  NA 2007»
13 years 4 months ago
Orthogonal projection regularization operators
Abstract. Tikhonov regularization often is applied with a finite difference regularization operator that approximates a low-order derivative. This paper proposes the use of ortho...
Serena Morigi, Lothar Reichel, Fiorella Sgallari
AAAI
2007
13 years 6 months ago
Point-Based Policy Iteration
We describe a point-based policy iteration (PBPI) algorithm for infinite-horizon POMDPs. PBPI replaces the exact policy improvement step of Hansen’s policy iteration with point...
Shihao Ji, Ronald Parr, Hui Li, Xuejun Liao, Lawre...