Search Sciweavers | Sciweavers

3837 search results - page 37 / 768

» Learning Approximate Consistencies

Voted

COLT
2000
Springer

87views Machine Learning» more COLT 2000»

Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning

15 years 6 months ago

Download www.cs.iastate.edu

We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process ( ¢¡¤£¦¥§ ), and focus on gradient ascent approache...

Peter L. Bartlett, Jonathan Baxter

claim paper

Read More »

144

Voted

ML
2008
ACM

174views Machine Learning» more ML 2008»

ALLPAD: approximate learning of logic programs with annotated disjunctions

15 years 1 months ago

Download www.ing.unife.it

In this paper we present the system ALLPAD for learning Logic Programs with Annotated Disjunctions (LPADs). ALLPAD modifies the previous system LLPAD in order to tackle real world ...

Fabrizio Riguzzi

claim paper

Read More »

129

click to vote

NIPS
2000

122views Information Technology» more NIPS 2000»

Rate-coded Restricted Boltzmann Machines for Face Recognition

15 years 3 months ago

Download www.cs.toronto.edu

We describe a neurally-inspired, unsupervised learning algorithm that builds a non-linear generative model for pairs of face images from the same individual. Individuals are then ...

Yee Whye Teh, Geoffrey E. Hinton

claim paper

Read More »

102

Voted

ICRA
2008
IEEE

113views Robotics» more ICRA 2008»

Reinforcement learning with function approximation for cooperative navigation tasks

15 years 8 months ago

Download gaips.inesc-id.pt

— In this paper, we propose a reinforcement learning approach to address multi-robot cooperative navigation tasks in inﬁnite settings. We propose an algorithm to simultaneously...

Francisco S. Melo, M. Isabel Ribeiro

claim paper

Read More »

112

Voted

NSDI
2004

127views Computer Networks» more NSDI 2004»

Consistent and Automatic Replica Regeneration

15 years 3 months ago

Download dslab.csie.ncu.edu.tw

Reducing management costs and improving the availability of large-scale distributed systems require automatic replica regeneration, i.e., creating new replicas in response to repl...

Haifeng Yu, Amin Vahdat

claim paper

Read More »

« Prev « First page 37 / 768 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers