Sciweavers

1997 search results - page 132 / 400
» On the convergence of Hill's method
Sort
View
CORR
2008
Springer
133views Education» more  CORR 2008»
15 years 1 months ago
Estimating divergence functionals and the likelihood ratio by convex risk minimization
We develop and analyze M-estimation methods for divergence functionals and the likelihood ratios of two probability distributions. Our method is based on a non-asymptotic variatio...
XuanLong Nguyen, Martin J. Wainwright, Michael I. ...
JMIV
2008
94views more  JMIV 2008»
15 years 1 months ago
Measuring Elongation from Shape Boundary
Abstract Shape elongation is one of the basic shape descriptors that has a very clear intuitive meaning. That is the reason for its applicability in many shape classification tasks...
Milos Stojmenovic, Jovisa D. Zunic
ORL
2008
124views more  ORL 2008»
15 years 1 months ago
Sample average approximation of expected value constrained stochastic programs
We propose a sample average approximation (SAA) method for stochastic programming problems involving an expected value constraint. Such problems arise, for example, in portfolio s...
Wei Wang, Shabbir Ahmed
SIAMCO
2008
112views more  SIAMCO 2008»
15 years 1 months ago
Stable Synchronization of Mechanical System Networks
In this paper we address stabilization of a network of underactuated mechanical systems with unstable dynamics. The coordinating control law stabilizes the unstable dynamics with ...
Sujit Nair, Naomi Ehrich Leonard
ICMLA
2010
14 years 11 months ago
Multimodal Parameter-exploring Policy Gradients
Abstract-- Policy Gradients with Parameter-based Exploration (PGPE) is a novel model-free reinforcement learning method that alleviates the problem of high-variance gradient estima...
Frank Sehnke, Alex Graves, Christian Osendorfer, J...