Sciweavers

480 search results - page 51 / 96
» Audio segmentation for speech recognition using segment feat...
Sort
View
ICMCS
2007
IEEE
150views Multimedia» more  ICMCS 2007»
15 years 7 months ago
Multicamera Audio-Visual Analysis of Dance Figures
We present a multi-camera system for audio-visual analysis of dance figures. The multi-view video of a dancing actor is acquired using 8 synchronized cameras. The motion capture t...
Ferda Ofli, Yasemin Demir, Engin Erzin, Yücel...
COGSCI
2002
99views more  COGSCI 2002»
15 years 3 months ago
Learning words from sights and sounds: a computational model
This paper presents an implemented computational model of word acquisition which learns directly from raw multimodal sensory input. Set in an information theoretic framework, the ...
Deb Roy, Alex Pentland
ACL
1990
15 years 5 months ago
Prosody, Syntax and Parsing
We describe the modification of a grammar to take advantage of prosodic information provided by a speech recognition system. This initial study is limited to the use of relative d...
John Bear, Patti Price
TASLP
2010
126views more  TASLP 2010»
15 years 2 months ago
Modeling Music as a Dynamic Texture
—We consider representing a short temporal fragment of musical audio as a dynamic texture, a model of both the timbral and rhythmical qualities of sound, two of the important asp...
Luke Barrington, Antoni B. Chan, Gert R. G. Lanckr...
123
Voted
ICASSP
2011
IEEE
14 years 7 months ago
Progress in example based automatic speech recognition
In this paper we present a number of improvements that were recently made to the template based speech recognition system developed at ESAT. Combining these improvements resulted ...
Kris Demuynck, Dino Seppi, Hugo Van hamme, Dirk Va...