Search Sciweavers | Sciweavers

74 search results - page 2 / 15

» Stochastic search using the natural gradient

click to vote

ISVC
2007
Springer

126views Applied Computing» more ISVC 2007»

Gradient-Based Hand Tracking Using Silhouette Data

14 years 8 days ago

Download www-sigproc.eng.cam.ac.uk

Optical motion capture can be classiﬁed as an inference problem: given the data produced by a set of cameras, the aim is to extract the hidden state, which in this case encodes t...

Paris Kaimakis, Joan Lasenby

claim paper

Read More »

click to vote

CEC
2008
IEEE

92views Artificial Intelligence» more CEC 2008»

Memetic Gradient Search

14 years 18 days ago

Download www3.ntu.edu.sg

—This paper reviews the different gradient-based schemes and the sources of gradient, their availability, precision and computational complexity, and explores the benefits of usi...

Boyang Li, Yew-Soon Ong, Minh Nghia Le, Chi Keong ...

claim paper

Read More »

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

13 years 7 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

click to vote

IJCAI
2003

169views Artificial Intelligence» more IJCAI 2003»

Covariant Policy Search

13 years 7 months ago

Download www.ri.cmu.edu

We investigate the problem of non-covariant behavior of policy gradient reinforcement learning algorithms. The policy gradient approach is amenable to analysis by information geom...

J. Andrew Bagnell, Jeff G. Schneider

claim paper

Read More »

click to vote

ACL
2009

165views Computational Linguistics» more ACL 2009»

Stochastic Gradient Descent Training for L1-regularized Log-linear Models with Cumulative Penalty

13 years 4 months ago

Download www.aclweb.org

Stochastic gradient descent (SGD) uses approximate gradients estimated from subsets of the training data and updates the parameters in an online fashion. This learning framework i...

Yoshimasa Tsuruoka, Jun-ichi Tsujii, Sophia Anania...

claim paper

Read More »

« Prev « First page 2 / 15 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers