Search Sciweavers | Sciweavers

65 search results - page 2 / 13

» Gradient Descent for General Reinforcement Learning

click to vote

GECCO
2006
Springer

159views Optimization» more GECCO 2006»

Standard and averaging reinforcement learning in XCS

13 years 9 months ago

Download www.cs.bham.ac.uk

This paper investigates reinforcement learning (RL) in XCS. First, it formally shows that XCS implements a method of generalized RL based on linear approximators, in which the usu...

Pier Luca Lanzi, Daniele Loiacono

claim paper

Read More »

click to vote

ICML
2009
IEEE

158views Machine Learning» more ICML 2009»

Gradient descent with sparsification: an iterative algorithm for sparse recovery with restricted isometry property

14 years 6 months ago

Download www.cs.mcgill.ca

We present an algorithm for finding an ssparse vector x that minimizes the squareerror y - x 2 where satisfies the restricted isometry property (RIP), with isometric constant 2s ...

Rahul Garg, Rohit Khandekar

claim paper

Read More »

click to vote

JMLR
2012

200views Programming Languages» more JMLR 2012»

Krylov Subspace Descent for Deep Learning

11 years 7 months ago

Download jmlr.csail.mit.edu

In this paper, we propose a second order optimization method to learn models where both the dimensionality of the parameter space and the number of training samples is high. In ou...

Oriol Vinyals, Daniel Povey

claim paper

Read More »

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

13 years 6 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

click to vote

ICDM
2010
IEEE

167views Data Mining» more ICDM 2010»

Averaged Stochastic Gradient Descent with Feedback: An Accurate, Robust, and Fast Training Method

13 years 3 months ago

Download www.ibis.t.u-tokyo.ac.jp

On large datasets, the popular training approach has been stochastic gradient descent (SGD). This paper proposes a modification of SGD, called averaged SGD with feedback (ASF), tha...

Xu Sun, Hisashi Kashima, Takuya Matsuzaki, Naonori...

claim paper

Read More »

« Prev « First page 2 / 13 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers