Sciweavers

249 search results - page 45 / 50
» Subspace Gaussian Mixture Models for speech recognition
Sort
View
ICASSP
2008
IEEE
15 years 4 months ago
Optimizing the acoustic modeling from an unbalanced bi-lingual corpus
Phoneme set clustering of accurate modeling is important in the task of multilingual speech recognition, especially when each of the available language training corpora is mismatc...
Dau-cheng Lyu, Ren-yuan Lyu
116
Voted
CVPR
2011
IEEE
14 years 6 months ago
Saliency Estimation Using a Non-Parametric Low-Level Vision Model
Many successful models for predicting attention in a scene involve three main steps: convolution with a set of filters, a center-surround mechanism and spatial pooling to constru...
Naila Murray, Maria Vanrell, Xavier Otazu, C. Alej...
TCSV
2008
175views more  TCSV 2008»
14 years 9 months ago
Expandable Data-Driven Graphical Modeling of Human Actions Based on Salient Postures
This paper presents a graphical model for learning and recognizing human actions. Specifically, we propose to encode actions in a weighted directed graph, referred to as action gra...
Wanqing Li, Zhengyou Zhang, Zicheng Liu
ICASSP
2009
IEEE
15 years 4 months ago
Detecting bandlimited audio in broadcast television shows
For TV and radio shows containing narrowband speech, Speech-to-text (STT) accuracy on the narrowband audio can be improved by using an acoustic model trained on acoustically match...
Mark C. Fuhs, Qin Jin, Tanja Schultz
CSL
2006
Springer
14 years 9 months ago
Support vector machines for speaker and language recognition
Support vector machines (SVMs) have proven to be a powerful technique for pattern classification. SVMs map inputs into a high dimensional space and then separate classes with a hy...
William M. Campbell, Joseph P. Campbell, Douglas A...