Search Sciweavers | Sciweavers

180 search results - page 9 / 36

» On the Convergence Rate of Good-Turing Estimators

122

click to vote

CORR
2011
Springer

167views Education» more CORR 2011»

Fast global convergence of gradient methods for high-dimensional statistical recovery

14 years 7 months ago

Download www.cs.berkeley.edu

Many statistical M-estimators are based on convex optimization problems formed by the weighted sum of a loss function with a norm-based regularizer. We analyze the convergence rat...

Alekh Agarwal, Sahand Negahban, Martin J. Wainwrig...

claim paper

Read More »

124

Voted

ISBI
2004
IEEE

150views Medical Imaging» more ISBI 2004»

A Fast Fully 4D Incremental Gradient Reconstruction Algorithm for List Mode PET Data

16 years 1 months ago

Download neuroimage.usc.edu

We present a fully four-dimensional, globally convergent, incremental gradient algorithm to estimate the continuous-time tracer density from list mode positron emission tomography...

Quanzheng Li, Evren Asma, Richard M. Leahy

claim paper

Read More »

113

Voted

TNN
2010

176views Management» more TNN 2010»

On the weight convergence of Elman networks

14 years 7 months ago

Download www3.ntu.edu.sg

Abstract--An Elman network (EN) can be viewed as a feedforward (FF) neural network with an additional set of inputs from the context layer (feedback from the hidden layer). Therefo...

Qing Song

claim paper

Read More »

Voted

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 4 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

111

Voted

ICML
2010
IEEE

222views Machine Learning» more ICML 2010»

Temporal Difference Bayesian Model Averaging: A Bayesian Perspective on Adapting Lambda

14 years 10 months ago

Download www.icml2010.org

Temporal difference (TD) algorithms are attractive for reinforcement learning due to their ease-of-implementation and use of "bootstrapped" return estimates to make effi...

Carlton Downey, Scott Sanner

claim paper

Read More »

« Prev « First page 9 / 36 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers