Sciweavers

ICASSP
2009
IEEE
13 years 10 months ago
On the phonetic information in ultrasonic microphone signals
We study the phonetic information in the signal from an ultrasonic “microphone”, a device that emits an ultrasonic wave toward a speaker and receives the reflected, Doppler-s...
Karen Livescu, Bo Zhu, James R. Glass
ICASSP
2009
IEEE
13 years 10 months ago
A study on multilingual acoustic modeling for large vocabulary ASR
We study key issues related to multilingual acoustic modeling for automatic speech recognition (ASR) through a series of large-scale ASR experiments. Our study explores shared str...
Hui Lin, Li Deng, Dong Yu, Yifan Gong, Alex Acero,...
CHI
2004
ACM
14 years 4 months ago
Semantic speech editing
Editing speech data is currently time-consuming and errorprone. Speech editors rely on acoustic waveform representations, which force users to repeatedly sample the underlying spe...
Steve Whittaker, Brian Amento
CHI
2006
ACM
14 years 4 months ago
Error correction of voicemail transcripts in SCANMail
Despite its widespread use, voicemail presents numerous usability challenges: People must listen to messages in their entirety, they cannot search by keywords, and audio files do ...
Moira Burke, Brian Amento, Philip L. Isenhour
WWW
2005
ACM
14 years 4 months ago
Web-assisted annotation, semantic indexing and search of television and radio news
The Rich News system, that can automatically annotate radio and television news with the aid of resources retrieved from the World Wide Web, is described. Automatic speech recogni...
Mike Dowman, Valentin Tablan, Hamish Cunningham, B...
ICPR
2004
IEEE
14 years 4 months ago
Structural Representation of Speech for Phonetic Classification
This paper explores the issues involved in using symbolic metric algorithms for automatic speech recognition (ASR), via a structural representation of speech. This representation ...
Alexander Gutkin, Simon King
ICPR
2008
IEEE
14 years 4 months ago
A phone-viseme dynamic Bayesian network for audio-visual automatic speech recognition
This work extends and improves a recently introduced (Dec. 2007) dynamic Bayesian network (DBN) based audio-visual automatic speech recognition (AVASR) system. That system models ...
Louis H. Terry, Aggelos K. Katsaggelos