Search Sciweavers | Sciweavers

1997 search results - page 120 / 400

» On the convergence of Hill's method

112

click to vote

NIPS
2007

164views Information Technology» more NIPS 2007»

Incremental Natural Actor-Critic Algorithms

15 years 2 months ago

Download books.nips.cc

We present four new reinforcement learning algorithms based on actor-critic and natural-gradient ideas, and provide their convergence proofs. Actor-critic reinforcement learning m...

Shalabh Bhatnagar, Richard S. Sutton, Mohammad Gha...

claim paper

Read More »

108

click to vote

CORR
2010
Springer

119views Education» more CORR 2010»

Dynamic Policy Programming

15 years 1 months ago

Download www.snn.ru.nl

In this paper, we consider the problem of planning and learning in the infinite-horizon discounted-reward Markov decision problems. We propose a novel iterative direct policysearc...

Mohammad Gheshlaghi Azar, Hilbert J. Kappen

claim paper

Read More »

132

click to vote

CVPR
2009
IEEE

316views Computer Vision» more CVPR 2009»

On compositional Image Alignment, with an application to Active Appearance Models

15 years 8 months ago

Download www.cs.unibas.ch

Efﬁcient and accurate ﬁtting of Active Appearance Models (AAM) is a key requirement for many applications. The most efﬁcient ﬁtting algorithm today is Inverse Compositiona...

Brian Amberg, Andrew Blake, Thomas Vetter

claim paper

Read More »

111

Voted

WSC
2000

112views Modeling And Simulation» more WSC 2000»

Generating "dependent" quasi-random numbers

15 years 2 months ago

Download www.informs-sim.org

Under certain conditions on the integrand, quasi-Monte Carlo methods for estimating integrals (expectations) converge faster asymptotically than Monte Carlo methods. Motivated by ...

Shane G. Henderson, Belinda A. Chiera, Roger M. Co...

claim paper

Read More »

141

click to vote

ML
2012
ACM

388views Machine Learning» more ML 2012»

Statistical analysis of kernel-based least-squares density-ratio estimation

13 years 8 months ago

Download sugiyama-www.cs.titech.ac.jp

The ratio of two probability densities can be used for solving various machine learning tasks such as covariate shift adaptation (importance sampling), outlier detection (likeliho...

Takafumi Kanamori, Taiji Suzuki, Masashi Sugiyama

claim paper

Read More »

« Prev « First page 120 / 400 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers