Sciweavers

246 search results - page 29 / 50
» interspeech 2010
Sort
View
INTERSPEECH
2010
14 years 5 months ago
Setup for acoustic-visual speech synthesis by concatenating bimodal units
This paper presents preliminary work on building a system able to synthesize concurrently the speech signal and a 3D animation of the speaker's face. This is done by concaten...
Asterios Toutios, Utpala Musti, Slim Ouni, Vincent...
INTERSPEECH
2010
14 years 5 months ago
Learning speaker normalization using semisupervised manifold alignment
As a child acquires language, he or she: perceives acoustic information in his or her surrounding environment; identifies portions of the ambient acoustic information as languager...
Andrew R. Plummer, Mary E. Beckman, Mikhail Belkin...
INTERSPEECH
2010
14 years 5 months ago
Competition in the perception of spoken Japanese words
Japanese listeners detected Japanese words embedded at the end of nonsense sequences (e.g., kaba 'hippopotamus' in gyachikaba). When the final portion of the preceding c...
Takashi Otake, James M. McQueen, Anne Cutler
INTERSPEECH
2010
14 years 5 months ago
Augmented set of features for confidence estimation in spoken term detection
Discriminative confidence estimation along with confidence normalisation have been shown to construct robust decision maker modules in spoken term detection (STD) systems. Discrim...
Javier Tejedor, Doroteo Torre Toledano, Miguel Bau...
INTERSPEECH
2010
14 years 5 months ago
Phone mismatch penalty matrices for two-stage keyword spotting via multi-pass phone recognizer
In this paper, we propose a novel approach to estimate three types of phone mismatch penalty matrices for two-state keyword spotting. When the output of a phone recognizer is give...
Chang Woo Han, Shin Jae Kang, Chul Min Lee, Nam So...