Sciweavers

376 search results - page 60 / 76
» Analysis-by-synthesis features for speech recognition
Sort
View
AAAI
2008
15 years 3 days ago
Unstructured Audio Classification for Environment Recognition
My thesis aims to contribute towards building autonomous agents that are able to understand their surrounding environment through the use of both audio and visual information. To ...
Selina Chu
CGI
2004
IEEE
15 years 1 months ago
Participant Activity Detection by Hands and Face Movement Tracking in the Meeting Room
For the purpose of Multimodal Meeting Manager Project (M4), an approach based on face and a hand tracking is proposed. The technique essentially includes skin color detection, seg...
Igor Potucek, Stanislav Sumec
AIIA
2005
Springer
14 years 11 months ago
Building a Wide Coverage Dynamic Grammar
Incremental processing is relevant for language modeling, speech recognition and language generation. In this paper we devise a dynamic version of Tree Adjoining Grammar (DVTAG) th...
Alessandro Mazzei, Vincenzo Lombardo
ICASSP
2009
IEEE
14 years 7 months ago
Volterra series for analyzing MLP based phoneme posterior estimator
We present a framework to apply Volterra series to analyze multilayered perceptrons trained to estimate the posterior probabilities of phonemes in automatic speech recognition. Th...
Joel Pinto, Garimella S. V. S. Sivaram, Hynek Herm...
INTERSPEECH
2010
14 years 4 months ago
Can conversational word usage be used to predict speaker demographics?
This work surveys the potential for predicting demographic traits of individual speakers (gender, age, education level, ethnicity, and geographic region) using only word usage fea...
Dan Gillick