This paper describes a new representation for the audio and visual information in a video signal. We reduce the dimensionality of the signals with singular-value decompositions (S...
DIn this paper 1 , we propose a shape-based variational framework to curve evolution for the segmentation of tongue contours from MRI mid-sagittal images. In particular, we first...
This study explores manifold representations of emotionally modulated speech. The manifolds are derived in the articulatory space and two acoustic spaces (MFB and MFCC) using isom...
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixture Model structure, and the means and mixture weights vary in a subspace of the...
Daniel Povey, Lukas Burget, Mohit Agarwal, Pinar A...
We propose a new framework for speaker recognition, referred as Fishervoice. It includes the design of a feature representation known as the structured score vector (SSV), which r...