Sciweavers

118 search results - page 3 / 24
» Combining many alignments for speech to speech translation
Sort
View
LREC
2008
105views Education» more  LREC 2008»
13 years 6 months ago
Linguistic Resources for Reconstructing Spontaneous Speech Text
The output of a speech recognition system is not always ideal for subsequent downstream processing, in part because speakers themselves often make mistakes. A system would accompl...
Erin Fitzgerald, Frederick Jelinek
AAAI
2012
11 years 7 months ago
Online Sequence Alignment for Real-Time Audio Transcription by Non-Experts
Real-time transcription provides deaf and hard of hearing people visual access to spoken content, such as classroom instruction, and other live events. Currently, the only reliabl...
Walter S. Lasecki, Christopher D. Miller, Donato B...
ICA
2004
Springer
13 years 10 months ago
Frequency Domain Blind Source Separation for Many Speech Signals
This paper presents a method for solving the permutation problem of frequency domain blind source separation (BSS) when the number of source signals is large, and the potential sou...
Ryo Mukai, Hiroshi Sawada, Shoko Araki, Shoji Maki...
ICASSP
2009
IEEE
13 years 11 months ago
Unsupervised pronunciation validation
This paper addresses selecting between candidate pronunciations for out-of-vocabulary words in speech processing tasks. We introduce a simple, unsupervised method that outperforms...
Christopher M. White, Abhinav Sethy, Bhuvana Ramab...
ICMCS
2000
IEEE
99views Multimedia» more  ICMCS 2000»
13 years 9 months ago
Automatic Selection of Visemes for Image-Based Visual Speech Synthesis
An image-based approach provides an efficient way for visual speech synthesis. In an image-based visual speech synthesis system, a few lip images, namely visemes, are used for ge...
Jie Yang, Jing Xiao, Max Ritter