Search Sciweavers | Sciweavers

3837 search results - page 65 / 768

» Learning Approximate Consistencies

125

click to vote

ICML
2008
IEEE

113views Machine Learning» more ICML 2008»

Training restricted Boltzmann machines using approximations to the likelihood gradient

16 years 2 months ago

Download www.cs.toronto.edu

A new algorithm for training Restricted Boltzmann Machines is introduced. The algorithm, named Persistent Contrastive Divergence, is different from the standard Contrastive Diverg...

Tijmen Tieleman

claim paper

Read More »

111

click to vote

ICRA
2009
IEEE

143views Robotics» more ICRA 2009»

Least absolute policy iteration for robust value function approximation

15 years 8 months ago

Download sugiyama-www.cs.titech.ac.jp

Abstract— Least-squares policy iteration is a useful reinforcement learning method in robotics due to its computational efﬁciency. However, it tends to be sensitive to outliers...

Masashi Sugiyama, Hirotaka Hachiya, Hisashi Kashim...

claim paper

Read More »

128

click to vote

ICML
1996
IEEE

196views Machine Learning» more ICML 1996»

A Convergent Reinforcement Learning Algorithm in the Continuous Case: The Finite-Element Reinforcement Learning

15 years 6 months ago

Download www.ri.cmu.edu

This paper presents a direct reinforcement learning algorithm, called Finite-Element Reinforcement Learning, in the continuous case, i.e. continuous state-space and time. The eval...

Rémi Munos

claim paper

Read More »

117

click to vote

IJCAI
2007

227views Artificial Intelligence» more IJCAI 2007»

A Scalable Kernel-Based Algorithm for Semi-Supervised Metric Learning

15 years 3 months ago

Download www.cs.ust.hk

In recent years, metric learning in the semisupervised setting has aroused a lot of research interests. One type of semi-supervised metric learning utilizes supervisory informatio...

Dit-Yan Yeung, Hong Chang, Guang Dai

claim paper

Read More »

120

click to vote

CORR
2006
Springer

104views Education» more CORR 2006»

Loop corrections for approximate inference

15 years 2 months ago

Download www.snn.ru.nl

We propose a method to improve approximate inference methods by correcting for the influence of loops in the graphical model. The method is a generalization and alternative implem...

Joris M. Mooij, Bert Kappen

claim paper

Read More »

« Prev « First page 65 / 768 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers