We establish the Stein phenomenon in the context of two-step, monotone incomplete data drawn from Np+q(µ, Σ), a multivariate normal population with mean µ and covariance matrix...
In this paper we show how common speech recognition training criteria such as the Minimum Phone Error criterion or the Maximum Mutual Information criterion can be extended to inco...
Tsochantaridis et al. (2005) proposed two formulations for maximum margin training of structured spaces: margin scaling and slack scaling. While margin scaling has been extensivel...
The mixmod (mixture modeling) program fits mixture models to a given data set for the purposes of density estimation, clustering or discriminant analysis. A large variety of algor...
We address the problem of learning the parameters in graphical models when inference is intractable. A common strategy in this case is to replace the partition function with its B...