Search Sciweavers | Sciweavers

21 search results - page 1 / 5

» A stochastic approximation method with max-norm projections ...

click to vote

TOMACS
2010

79views more TOMACS 2010»

A stochastic approximation method with max-norm projections and its applications to the Q-learning algorithm

12 years 11 months ago

Download legacy.orie.cornell.edu

In this paper, we develop a stochastic approximation method to solve a monotone estimation problem and use this method to enhance the empirical performance of the Q-learning algor...

Sumit Kunnumkal, Huseyin Topaloglu

claim paper

Read More »

click to vote

NIPS
2008

125views Information Technology» more NIPS 2008»

An interior-point stochastic approximation method and an L1-regularized delta rule

13 years 6 months ago

Download www.cs.ubc.ca

The stochastic approximation method is behind the solution to many important, actively-studied problems in machine learning. Despite its farreaching application, there is almost n...

Peter Carbonetto, Mark Schmidt, Nando de Freitas

claim paper

Read More »

click to vote

CPAIOR
2008
Springer

198views Operations Research» more CPAIOR 2008»

Amsaa: A Multistep Anticipatory Algorithm for Online Stochastic Combinatorial Optimization

13 years 6 months ago

Download cs.brown.edu

The one-step anticipatory algorithm (1s-AA) is an online algorithm making decisions under uncertainty by ignoring future non-anticipativity constraints. It makes near-optimal decis...

Luc Mercier, Pascal Van Hentenryck

claim paper

Read More »

click to vote

ACL
2009

165views Computational Linguistics» more ACL 2009»

Stochastic Gradient Descent Training for L1-regularized Log-linear Models with Cumulative Penalty

13 years 2 months ago

Download www.aclweb.org

Stochastic gradient descent (SGD) uses approximate gradients estimated from subsets of the training data and updates the parameters in an online fashion. This learning framework i...

Yoshimasa Tsuruoka, Jun-ichi Tsujii, Sophia Anania...

claim paper

Read More »

click to vote

CDC
2010
IEEE

104views Control Systems» more CDC 2010»

Single timescale regularized stochastic approximation schemes for monotone Nash games under uncertainty

12 years 11 months ago

Download netfiles.uiuc.edu

Abstract-- In this paper, we consider the distributed computation of equilibria arising in monotone stochastic Nash games over continuous strategy sets. Such games arise in setting...

Jayash Koshal, Angelia Nedic, Uday V. Shanbhag

claim paper

Read More »

« Prev « First page 1 / 5 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers