Speaker recognition using support vector machines (SVMs) with features derived from generative models has been shown to perform well. Typically, a universal background model (UBM)...
Physiological properties of the glottis and the vocal tract change with age and gender. Since these changes are reflected in the speech signal, acoustic measures related to those...
Zero-norm, defined as the number of non-zero elements in a vector, is an ideal quantity for feature selection. However, minimization of zero-norm is generally regarded as a combi...
Traditional tonality mode (major or minor) classification or audio key finding algorithms often rely on tonic annotations (key names) of the training songs. However, unlike clas...
An original approach to represent 2D and 3D faces using Radial Geodesic Distances (RGDs) is proposed in this work. In 3D, the RGD of a generic point of the face surface is compute...
Stefano Berretti, Alberto Del Bimbo, Pietro Pala, ...