Sciweavers

1997 search results - page 122 / 400
» On the convergence of Hill's method
Sort
View
IJCAI
2001
15 years 2 months ago
Exploiting Multiple Secondary Reinforcers in Policy Gradient Reinforcement Learning
Most formulations of Reinforcement Learning depend on a single reinforcement reward value to guide the search for the optimal policy solution. If observation of this reward is rar...
Gregory Z. Grudic, Lyle H. Ungar
NIPS
2004
15 years 2 months ago
Limits of Spectral Clustering
An important aspect of clustering algorithms is whether the partitions constructed on finite samples converge to a useful clustering of the whole data space as the sample size inc...
Ulrike von Luxburg, Olivier Bousquet, Mikhail Belk...
MP
2002
195views more  MP 2002»
15 years 27 days ago
Nonlinear rescaling vs. smoothing technique in convex optimization
We introduce an alternative to the smoothing technique approach for constrained optimization. As it turns out for any given smoothing function there exists a modification with part...
Roman A. Polyak
AMC
2010
175views more  AMC 2010»
15 years 1 months ago
An inexact parallel splitting augmented Lagrangian method for large system of linear equations
: Parallel iterative methods are powerful tool for solving large system of linear equations (LEs). The existing parallel computing research results are focussed mainly on sparse sy...
Zheng Peng, DongHua Wu
ISBI
2006
IEEE
15 years 7 months ago
Consistent spherical parameterisation for statistical shape modelling
We have described previously a method of automatically constructing statistical models of shape. The method treats model-building as an optimisation problem by re-parameterising ea...
Rhodri H. Davies, Carole J. Twining, Christopher J...