Sciweavers

551 search results - page 86 / 111
» Multimodal Speech Synthesis
Sort
View
CSO
2009
IEEE
15 years 4 months ago
A Special Prosodic Phrasing in Broadcasting News Programs
In order to improve and convey the semantic information perfectly and vividly through synthesized speech, a special prosodic phrasing in broadcasting news programs, namely complex...
Yu Zou, Wei He, Yuqiang Zhang, Min Hou, Weibin Zhu
CEE
2010
70views more  CEE 2010»
14 years 10 months ago
An iterative linearised solution to the sinusoidal parameter estimation problem
Signal processing applications use sinusoidal modelling for speech synthesis, speech coding, and audio coding. Estimation of the model parameters involves non-linear optimisation ...
Jean-Marc Valin, Daniel V. Smith, Christopher Mont...
ICASSP
2011
IEEE
14 years 1 months ago
Decision tree-based context clustering based on cross validation and hierarchical priors
The standard, ad-hoc stopping criteria used in decision tree-based context clustering are known to be sub-optimal and require parameters to be tuned. This paper proposes a new app...
Heiga Zen, Mark J. F. Gales
FGR
2004
IEEE
126views Biometrics» more  FGR 2004»
15 years 1 months ago
Trainable Videorealistic Speech Animation
We describe how to create with machine learning techniques a generative, videorealistic, speech animation module. A human subject is first recorded using a videocamera as he/she u...
Tony Ezzat, Gadi Geiger, Tomaso Poggio
CHI
2008
ACM
15 years 10 months ago
On the benefits of confidence visualization in speech recognition
In a typical speech dictation interface, the recognizer's bestguess is displayed as normal, unannotated text. This ignores potentially useful information about the recognizer...
Keith Vertanen, Per Ola Kristensson