Search Sciweavers | Sciweavers

107 search results - page 7 / 22

» Learning to rank using gradient descent

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 17 days ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

103

click to vote

ICML
1995
IEEE

184views Machine Learning» more ICML 1995»

Residual Algorithms: Reinforcement Learning with Function Approximation

15 years 12 months ago

Download www.leemon.com

A number of reinforcement learning algorithms have been developed that are guaranteed to converge to the optimal solution when used with lookup tables. It is shown, however, that ...

Leemon C. Baird III

claim paper

Read More »

click to vote

EMNLP
2009

123views Natural Language Processing» more EMNLP 2009»

First- and Second-Order Expectation Semirings with Applications to Minimum-Risk Training on Translation Forests

14 years 9 months ago

Download cs.jhu.edu

Many statistical translation models can be regarded as weighted logical deduction. Under this paradigm, we use weights from the expectation semiring (Eisner, 2002), to compute fir...

Zhifei Li, Jason Eisner

claim paper

Read More »

click to vote

JMLR
2012

176views Programming Languages» more JMLR 2012»

SpeedBoost: Anytime Prediction with Uniform Near-Optimality

13 years 1 months ago

Download www.ri.cmu.edu

We present SpeedBoost, a natural extension of functional gradient descent, for learning anytime predictors, which automatically trade computation time for predictive accuracy by s...

Alexander Grubb, Drew Bagnell

claim paper

Read More »

click to vote

ICANN
2009
Springer

113views Neural Networks» more ICANN 2009»

Evolving Memory Cell Structures for Sequence Learning

15 years 5 months ago

Download julian.togelius.com

The best recent supervised sequence learning methods use gradient descent to train networks of miniature nets called memory cells. The most popular cell structure seems somewhat ar...

Justin Bayer, Daan Wierstra, Julian Togelius, J&uu...

claim paper

Read More »

« Prev « First page 7 / 22 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers