Search Sciweavers | Sciweavers

7928 search results - page 272 / 1586

» Human-Like Learning Methods for a

163

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 7 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

200

click to vote

ATAL
2005
Springer

181views Intelligent Agents» more ATAL 2005»

Improving reinforcement learning function approximators via neuroevolution

15 years 11 months ago

Download www.aaai.org

Reinforcement learning problems are commonly tackled with temporal difference methods, which use dynamic programming and statistical sampling to estimate the long-term value of ta...

Shimon Whiteson

claim paper

Read More »

163

click to vote

CVPR
2007
IEEE

164views Computer Vision» more CVPR 2007»

Hybrid learning of large jigsaws

16 years 8 months ago

Download johnwinn.org

A jigsaw is a recently proposed generative model that describes an image as a composition of non-overlapping patches of varying shape, extracted from a latent image. By learning t...

Julia A. Lasserre, Anitha Kannan, John M. Winn

claim paper

Read More »

171

click to vote

ICPR
2008
IEEE

151views Computer Vision» more ICPR 2008»

Learning weighted distances for relevance feedback in image retrieval

16 years 7 months ago

Download www-i6.informatik.rwth-aachen.de

We present a new method for relevance feedback in image retrieval and a scheme to learn weighted distances which can be used in combination with different relevance feedback metho...

Enrique Vidal, Hermann Ney, Roberto Paredes, Thoma...

claim paper

Read More »

154

click to vote

JMLR
2010

103views more JMLR 2010»

Learning Nonlinear Dynamic Models from Non-sequenced Data

15 years 28 days ago

Download www.cs.cmu.edu

Virtually all methods of learning dynamic systems from data start from the same basic assumption: the learning algorithm will be given a sequence of data generated from the dynami...

Tzu-Kuo Huang, Le Song, Jeff Schneider

claim paper

Read More »

« Prev « First page 272 / 1586 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers