Search Sciweavers | Sciweavers

12 search results - page 1 / 3

» Topmoumoute Online Natural Gradient Algorithm

154

click to vote

NIPS
2007

142views Information Technology» more NIPS 2007»

Topmoumoute Online Natural Gradient Algorithm

15 years 7 months ago

Download books.nips.cc

Guided by the goal of obtaining an optimization algorithm that is both fast and yields good generalization, we study the descent direction maximizing the decrease in generalizatio...

Nicolas Le Roux, Pierre-Antoine Manzagol, Yoshua B...

claim paper

Read More »

165

Voted

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 7 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

207

click to vote

JMLR
2010

189views more JMLR 2010»

Adaptive Step-size Policy Gradients with Average Reward Metric

15 years 1 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a novel adaptive step-size approach for policy gradient reinforcement learning. A new metric is defined for policy gradients that measures the effect of ...

Takamitsu Matsubara, Tetsuro Morimura, Jun Morimot...

claim paper

Read More »

178

click to vote

ACL
2009

165views Computational Linguistics» more ACL 2009»

Stochastic Gradient Descent Training for L1-regularized Log-linear Models with Cumulative Penalty

15 years 4 months ago

Download www.aclweb.org

Stochastic gradient descent (SGD) uses approximate gradients estimated from subsets of the training data and updates the parameters in an online fashion. This learning framework i...

Yoshimasa Tsuruoka, Jun-ichi Tsujii, Sophia Anania...

claim paper

Read More »

212

click to vote

JMLR
2008

230views more JMLR 2008»

Exponentiated Gradient Algorithms for Conditional Random Fields and Max-Margin Markov Networks

15 years 6 months ago

Download www.stat.berkeley.edu

Log-linear and maximum-margin models are two commonly-used methods in supervised machine learning, and are frequently used in structured prediction problems. Efficient learning of...

Michael Collins, Amir Globerson, Terry Koo, Xavier...

claim paper

Read More »

« Prev « First page 1 / 3 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers