Sciweavers

841 search results - page 105 / 169
» An Estimation of Distribution Particle Swarm Optimization Al...
Sort
View
NIPS
2008
15 years 4 months ago
Signal-to-Noise Ratio Analysis of Policy Gradient Algorithms
Policy gradient (PG) reinforcement learning algorithms have strong (local) convergence guarantees, but their learning performance is typically limited by a large variance in the e...
John W. Roberts, Russ Tedrake
RT
2001
Springer
15 years 7 months ago
Path Differentials and Applications
Abstract. Photo-realistic rendering algorithms such as Monte Carlo ray tracing sample individual paths to compute images. Noise and aliasing artefacts are usually reduced by supers...
Frank Suykens, Yves D. Willems
CORR
2008
Springer
133views Education» more  CORR 2008»
15 years 3 months ago
Estimating divergence functionals and the likelihood ratio by convex risk minimization
We develop and analyze M-estimation methods for divergence functionals and the likelihood ratios of two probability distributions. Our method is based on a non-asymptotic variatio...
XuanLong Nguyen, Martin J. Wainwright, Michael I. ...
ICCV
2011
IEEE
14 years 6 months ago
Outdoor Human Motion Capture using Inverse Kinematics and von Mises-Fisher Sampling
Human motion capturing (HMC) from multiview image sequences constitutes an extremely difficult problem due to depth and orientation ambiguities and the high dimensionality of the s...
Gerard Pons-Moll, Andreas Baak, Juergen Gall, Laur...
GECCO
2004
Springer
15 years 8 months ago
How Are We Doing? Predicting Evolutionary Algorithm Performance
Abstract. Given an evolutionary algorithm for a problem and an instance of the problem, the results of several trials of the EA on the instance constitute a sample from the distrib...
Mark A. Renslow, Brenda Hinkemeyer, Bryant A. Juls...