Sciweavers

7928 search results - page 272 / 1586
» Human-Like Learning Methods for a
Sort
View
NIPS
2007
15 years 7 months ago
Incremental Natural Actor-Critic Algorithms
We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...
Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...
ATAL
2005
Springer
15 years 11 months ago
Improving reinforcement learning function approximators via neuroevolution
Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...
Shimon Whiteson
CVPR
2007
IEEE
16 years 8 months ago
Hybrid learning of large jigsaws
A jigsaw is a recently proposed generative model that describes an image as a composition of non-overlapping patches of varying shape, extracted from a latent image. By learning t...
Julia A. Lasserre, Anitha Kannan, John M. Winn
ICPR
2008
IEEE
16 years 7 months ago
Learning weighted distances for relevance feedback in image retrieval
We present a new method for relevance feedback in image retrieval and a scheme to learn weighted distances which can be used in combination with different relevance feedback metho...
Enrique Vidal, Hermann Ney, Roberto Paredes, Thoma...
JMLR
2010
103views more  JMLR 2010»
15 years 28 days ago
Learning Nonlinear Dynamic Models from Non-sequenced Data
Virtually all methods of learning dynamic systems from data start from the same basic assumption: the learning algorithm will be given a sequence of data generated from the dynami...
Tzu-Kuo Huang, Le Song, Jeff Schneider