Search Sciweavers | Sciweavers

2711 search results - page 239 / 543

» Convergence of the Wake-Sleep Algorithm

175

Voted

CDC
2010
IEEE

160views Control Systems» more CDC 2010»

Aggregation-based model reduction of a Hidden Markov Model

14 years 12 months ago

Download mechse.illinois.edu

This paper is concerned with developing an information-theoretic framework to aggregate the state space of a Hidden Markov Model (HMM) on discrete state and observation spaces. The...

Kun Deng, Prashant G. Mehta, Sean P. Meyn

claim paper

Read More »

132

Voted

CDC
2010
IEEE

122views Control Systems» more CDC 2010»

Nonholonomic source seeking in switching random fields

14 years 12 months ago

Download www.seas.upenn.edu

We consider the problem of designing controllers for nonholonomic mobile robots converging to the source (minimum) of a field. In addition to the mobility constraints posed by the ...

Shun-ichi Azuma, Mahmut Selman Sakar, George J. Pa...

claim paper

Read More »

143

Voted

CDC
2010
IEEE

109views Control Systems» more CDC 2010»

On-line optimal timing control of switched systems

14 years 12 months ago

Download www.prism.gatech.edu

This paper considers a real-time algorithm for performance optimization of switched-mode hybrid dynamical systems. The controlled parameter consists of the switching times between ...

Yorai Wardi, Philip Twu, Magnus Egerstedt

claim paper

Read More »

134

Voted

CORR
2011
Springer

205views Education» more CORR 2011»

Parallel Coordinate Descent for L1-Regularized Loss Minimization

14 years 8 months ago

Download www.icml-2011.org

We propose Shotgun, a parallel coordinate descent algorithm for minimizing L1regularized losses. Though coordinate descent seems inherently sequential, we prove convergence bounds...

Joseph K. Bradley, Aapo Kyrola, Danny Bickson, Car...

claim paper

Read More »

156

click to vote

AAAI
2011

144views Intelligent Agents» more AAAI 2011»

Differential Eligibility Vectors for Advantage Updating and Gradient Methods

14 years 5 months ago

Download gaips.inesc-id.pt

In this paper we propose differential eligibility vectors (DEV) for temporal-difference (TD) learning, a new class of eligibility vectors designed to bring out the contribution of...

Francisco S. Melo

claim paper

Read More »

« Prev « First page 239 / 543 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers