Sciweavers

147 search results - page 13 / 30
» Speech Enhancement Using Gaussian Scale Mixture Models
Sort
View
TVCG
2012
191views Hardware» more  TVCG 2012»
13 years 2 months ago
Live Speech Driven Head-and-Eye Motion Generators
—This paper describes a fully automated framework to generate realistic head motion, eye gaze, and eyelid motion simultaneously based on live (or recorded) speech input. Its cent...
Binh Huy Le, Xiaohan Ma, Zhigang Deng
ICASSP
2010
IEEE
14 years 12 months ago
Acoustic front-end optimization for bird species recognition
The goal of this work was to explore the optimization of the feature extraction module (front-end) parameters to improve bird species recognition. We explored optimizing the spect...
Martin Graciarena, Michelle Delplanche, Elizabeth ...
MLMI
2005
Springer
15 years 5 months ago
The TNO Speaker Diarization System for NIST RT05s Meeting Data
The TNO speaker speaker diarization system is based on a standard BIC segmentation and clustering algorithm. Since for the NIST Rich Transcription speaker dizarization evaluation m...
David van Leeuwen
101
Voted
SPEECH
1998
69views more  SPEECH 1998»
14 years 11 months ago
Noisy speech enhancement using discrete cosine transform
This paper illustrates the advantages of using the Discrete Cosine Transform (DCT) as compared to the standard Discrete Fourier Transform (DFT) for the purpose of removing noise e...
Ing Yann Soon, Soo Ngee Koh, Chai Kiat Yeo
CSL
2010
Springer
14 years 12 months ago
Monaural speech separation based on MAXVQ and CASA for robust speech recognition
Robustness is one of the most important topics for automatic speech recognition (ASR) in practical applications. Monaural speech separation based on computational auditory scene a...
Peng Li, Yong Guan, Shijin Wang, Bo Xu, Wenju Liu