Sciweavers

377 search results - page 11 / 76
» Convergence of Stochastic Iterative Dynamic Programming Algo...
Sort
View
CORR
2010
Springer
119views Education» more  CORR 2010»
14 years 9 months ago
Dynamic Policy Programming
In this paper, we consider the problem of planning and learning in the infinite-horizon discounted-reward Markov decision problems. We propose a novel iterative direct policysearc...
Mohammad Gheshlaghi Azar, Hilbert J. Kappen
CORR
2007
Springer
94views Education» more  CORR 2007»
14 years 9 months ago
Paging and Registration in Cellular Networks: Jointly Optimal Policies and an Iterative Algorithm
— This paper explores optimization of paging and registration policies in cellular networks. Motion is modeled as a discrete-time Markov process, and minimization of the discount...
Bruce Hajek, Kevin Mitzel, Sichao Yang
CDC
2009
IEEE
119views Control Systems» more  CDC 2009»
15 years 2 months ago
Linear Parameter Varying Iterative Learning Control
— In this paper an Iterative Learning Control (ILC) algorithm is proposed for a certain class of Linear Parameter Varying (LPV) systems whose dynamics change between iterations. ...
Mark Edward John Butcher, Alireza Karimi
CORR
2010
Springer
66views Education» more  CORR 2010»
14 years 9 months ago
Computing the speed of convergence of ergodic averages and pseudorandom points in computable dynamical systems
A pseudorandom point in an ergodic dynamical system over a computable metric space is a point which is computable but its dynamics has the same statistical behavior of a typical po...
Stefano Galatolo, Mathieu Hoyrup, Cristobal Rojas
JMLR
2012
13 years 3 days ago
Multi Kernel Learning with Online-Batch Optimization
In recent years there has been a lot of interest in designing principled classification algorithms over multiple cues, based on the intuitive notion that using more features shou...
Francesco Orabona, Jie Luo, Barbara Caputo