Sciweavers

2711 search results - page 239 / 543
» Convergence of the Wake-Sleep Algorithm
Sort
View
175
Voted
CDC
2010
IEEE
160views Control Systems» more  CDC 2010»
14 years 12 months ago
Aggregation-based model reduction of a Hidden Markov Model
This paper is concerned with developing an information-theoretic framework to aggregate the state space of a Hidden Markov Model (HMM) on discrete state and observation spaces. The...
Kun Deng, Prashant G. Mehta, Sean P. Meyn
132
Voted
CDC
2010
IEEE
122views Control Systems» more  CDC 2010»
14 years 12 months ago
Nonholonomic source seeking in switching random fields
We consider the problem of designing controllers for nonholonomic mobile robots converging to the source (minimum) of a field. In addition to the mobility constraints posed by the ...
Shun-ichi Azuma, Mahmut Selman Sakar, George J. Pa...
143
Voted
CDC
2010
IEEE
109views Control Systems» more  CDC 2010»
14 years 12 months ago
On-line optimal timing control of switched systems
This paper considers a real-time algorithm for performance optimization of switched-mode hybrid dynamical systems. The controlled parameter consists of the switching times between ...
Yorai Wardi, Philip Twu, Magnus Egerstedt
134
Voted
CORR
2011
Springer
205views Education» more  CORR 2011»
14 years 8 months ago
Parallel Coordinate Descent for L1-Regularized Loss Minimization
We propose Shotgun, a parallel coordinate descent algorithm for minimizing L1regularized losses. Though coordinate descent seems inherently sequential, we prove convergence bounds...
Joseph K. Bradley, Aapo Kyrola, Danny Bickson, Car...
AAAI
2011
14 years 5 months ago
Differential Eligibility Vectors for Advantage Updating and Gradient Methods
In this paper we propose differential eligibility vectors (DEV) for temporal-difference (TD) learning, a new class of eligibility vectors designed to bring out the contribution of...
Francisco S. Melo