Search Sciweavers | Sciweavers

2711 search results - page 149 / 543

» Convergence of the Wake-Sleep Algorithm

141

click to vote

ICMLA
2010

203views Machine Learning» more ICMLA 2010»

Multimodal Parameter-exploring Policy Gradients

15 years 2 months ago

Download www6.in.tum.de

Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...

Frank Sehnke, Alex Graves, Christian Osendorfer, J...

claim paper

Read More »

151

click to vote

CDC
2010
IEEE

89views Control Systems» more CDC 2010»

Stochastic approximation for consensus with general time-varying weight matrices

14 years 11 months ago

Download mathstat.carleton.ca

This paper considers consensus problems with delayed noisy measurements, and stochastic approximation is used to achieve mean square consensus. For stochastic approximation based c...

Minyi Huang

claim paper

Read More »

142

click to vote

CDC
2010
IEEE

124views Control Systems» more CDC 2010»

Hybrid control for navigation of shape-accelerated underactuated balancing systems

14 years 11 months ago

Download www.cs.cmu.edu

This paper presents a hybrid control strategy for navigation of shape-accelerated underactuated balancing systems with dynamic constraints. It extends the concept of sequential com...

Umashankar Nagarajan, George Kantor, Ralph L. Holl...

claim paper

Read More »

139

click to vote

SIAMNUM
2010

103views more SIAMNUM 2010»

Hybridization and Postprocessing Techniques for Mixed Eigenfunctions

14 years 11 months ago

Download www.rpi.edu

Abstract. We introduce hybridization and postprocessing techniques for the RaviartThomas approximation of second-order elliptic eigenvalue problems. Hybridization reduces the Ravia...

Bernardo Cockburn, Jayadeep Gopalakrishnan, F. Li,...

claim paper

Read More »

162

click to vote

AAAI
2011

168views Intelligent Agents» more AAAI 2011»

Dual Decomposition for Marginal Inference

14 years 4 months ago

Download phd.gccis.rit.edu

We present a dual decomposition approach to the treereweighted belief propagation objective. Each tree in the tree-reweighted bound yields one subproblem, which can be solved with...

Justin Domke

claim paper

Read More »

« Prev « First page 149 / 543 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers