Sciweavers

344 search results - page 54 / 69
» speech 2011
Sort
View
ICASSP
2011
IEEE
14 years 1 months ago
Dynamics of tongue gestures extracted automatically from ultrasound
We describe a system for automatically extracting dynamics of tongue gestures from ultrasound images of the tongue using translational deep belief networks (tDBNs). In tDBNs, a jo...
Jeff Berry, Ian Fasel
ICASSP
2011
IEEE
14 years 1 months ago
Deep belief nets for natural language call-routing
This paper considers application of Deep Belief Nets (DBNs) to natural language call routing. DBNs have been successfully applied to a number of tasks, including image, audio and ...
Ruhi Sarikaya, Geoffrey E. Hinton, Bhuvana Ramabha...
ICASSP
2011
IEEE
14 years 1 months ago
Gain-robust multi-pitch tracking using sparse nonnegative matrix factorization
While nonnegative matrix factorization (NMF) has successfully been applied for gain-robust multi-pitch detection, a method to track pitch values over time was not provided. We emb...
Robert Peharz, Michael Wohlmayr, Franz Pernkopf
ICASSP
2011
IEEE
14 years 1 months ago
Generating avatar's facial expressions from emotional states in daily conversation
A framework for generating facial expressions from emotional states in daily conversation is described. The framework allows avatars to express the speaker’s state not just prot...
Hiroki Mori, Ko Oshima, Makoto Nakamura
ICASSP
2011
IEEE
14 years 1 months ago
Real time speaker localization and detection system for camera steering in multiparticipant videoconferencing environments
A real time speaker localization and detection system for videoconferencing environments is presented. In this system, a recently proposed modified Steered Response Power - Phase...
Amparo Marti, Maximo Cobos, José J. L&oacut...