Sciweavers

3837 search results - page 37 / 768
» Learning Approximate Consistencies
Sort
View
96
Voted
COLT
2000
Springer
15 years 6 months ago
Estimation and Approximation Bounds for Gradient-Based Reinforcement Learning
We model reinforcement learning as the problem of learning to control a Partially Observable Markov Decision Process (  ¢¡¤£¦¥§  ), and focus on gradient ascent approache...
Peter L. Bartlett, Jonathan Baxter
144
Voted
ML
2008
ACM
174views Machine Learning» more  ML 2008»
15 years 1 months ago
ALLPAD: approximate learning of logic programs with annotated disjunctions
In this paper we present the system ALLPAD for learning Logic Programs with Annotated Disjunctions (LPADs). ALLPAD modifies the previous system LLPAD in order to tackle real world ...
Fabrizio Riguzzi
NIPS
2000
15 years 3 months ago
Rate-coded Restricted Boltzmann Machines for Face Recognition
We describe a neurally-inspired, unsupervised learning algorithm that builds a non-linear generative model for pairs of face images from the same individual. Individuals are then ...
Yee Whye Teh, Geoffrey E. Hinton
102
Voted
ICRA
2008
IEEE
113views Robotics» more  ICRA 2008»
15 years 8 months ago
Reinforcement learning with function approximation for cooperative navigation tasks
— In this paper, we propose a reinforcement learning approach to address multi-robot cooperative navigation tasks in infinite settings. We propose an algorithm to simultaneously...
Francisco S. Melo, M. Isabel Ribeiro
112
Voted
NSDI
2004
15 years 3 months ago
Consistent and Automatic Replica Regeneration
Reducing management costs and improving the availability of large-scale distributed systems require automatic replica regeneration, i.e., creating new replicas in response to repl...
Haifeng Yu, Amin Vahdat