Sciweavers

32 search results - page 6 / 7
» Translingual Visual Speech Synthesis
Sort
View
FGR
2004
IEEE
126views Biometrics» more  FGR 2004»
13 years 9 months ago
Trainable Videorealistic Speech Animation
We describe how to create with machine learning techniques a generative, videorealistic, speech animation module. A human subject is first recorded using a videocamera as he/she u...
Tony Ezzat, Gadi Geiger, Tomaso Poggio
MM
2009
ACM
169views Multimedia» more  MM 2009»
14 years 3 days ago
Visual speaker localization aided by acoustic models
The following paper presents a novel audio-visual approach for unsupervised speaker locationing. Using recordings from a single, low-resolution room overview camera and a single f...
Gerald Friedland, Chuohao Yeo, Hayley Hung
CSL
2002
Springer
13 years 5 months ago
Learning visually grounded words and syntax for a scene description task
A spoken language generation system has been developed that learns to describe objects in computer-generated visual scenes. The system is trained by a `show-and-tell' procedu...
Deb K. Roy
CORR
2010
Springer
197views Education» more  CORR 2010»
13 years 5 months ago
Modelling of Human Glottis in VLSI for Low Power Architectures
The Glottal Source is an important component of voice as it can be considered as the excitation signal to the voice apparatus. Nowadays, new techniques of speech processing such a...
Nikhil Raj, R. K. Sharma
CHI
2006
ACM
14 years 6 months ago
Error correction of voicemail transcripts in SCANMail
Despite its widespread use, voicemail presents numerous usability challenges: People must listen to messages in their entirety, they cannot search by keywords, and audio files do ...
Moira Burke, Brian Amento, Philip L. Isenhour