Sciweavers

249 search results - page 24 / 50
» Subspace Gaussian Mixture Models for speech recognition
Sort
View
TVCG
2012
191views Hardware» more  TVCG 2012»
13 years 3 days ago
Live Speech Driven Head-and-Eye Motion Generators
—This paper describes a fully automated framework to generate realistic head motion, eye gaze, and eyelid motion simultaneously based on live (or recorded) speech input. Its cent...
Binh Huy Le, Xiaohan Ma, Zhigang Deng
MLMI
2005
Springer
15 years 3 months ago
The TNO Speaker Diarization System for NIST RT05s Meeting Data
The TNO speaker speaker diarization system is based on a standard BIC segmentation and clustering algorithm. Since for the NIST Rich Transcription speaker dizarization evaluation m...
David van Leeuwen
TASLP
2010
159views more  TASLP 2010»
14 years 4 months ago
Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model
This article addresses the modeling of reverberant recording environments in the context of under-determined convolutive blind source separation. We model the contribution of each ...
Ngoc Q. K. Duong, Emmanuel Vincent, Rémi Gr...
ISMIR
2004
Springer
117views Music» more  ISMIR 2004»
15 years 3 months ago
Instrument identification in solo and ensemble music using Independent Subspace Analysis
We investigate the use of Independent Subspace Analysis (ISA) for instrument identification in musical recordings. We represent short-term log-power spectra of possibly polyphoni...
Emmanuel Vincent, Xavier Rodet
BMVC
2010
14 years 7 months ago
Local Gaussian Processes for Pose Recognition from Noisy Inputs
Gaussian processes have been widely used as a method for inferring the pose of articulated bodies directly from image data. While able to model complex non-linear functions, they ...
Martin Fergie, Aphrodite Galata