Sciweavers

ACSC
2004
IEEE
13 years 8 months ago
Sensor Fusion Weighting Measures in Audio-Visual Speech Recognition
Audio-Visual Speech Recognition (AVSR) uses vision to enhance speech recognition but also introduces the problem of how to join (or fuse) these two signals together. Mainstream re...
Trent W. Lewis, David M. W. Powers
ACMSE
2006
ACM
13 years 8 months ago
A speech recognition and synthesis tool
Many of the new technologies designed to help worldwide communication
Hala ElAarag, Laura Schindler
UIST
1992
ACM
13 years 8 months ago
Tools for Building Asynchronous Servers to Support Speech and Audio Applications
Distributed clientisewer models are becoming increasingly prevalent in multimedia systems and advanced user interface design. A multimedia application, for example, may play and r...
Barry Arons
ECCV
1998
Springer
13 years 9 months ago
Continuous Audio-Visual Speech Recognition
The Multi-Stream automatic speech recognition approach was investigated in this work as a framework for Audio-Visual data fusion and speech recognition. This method presents many ...
Juergen Luettin, Stéphane Dupont
ADL
1998
Springer
164views Digital Library» more  ADL 1998»
13 years 9 months ago
Story Segmentation and Detection of Commercials in Broadcast News Video
The Informedia Digital Library Project [Wactlar96] allows full content indexing and retrieval of text, audio and video material. Segmentation is an integral process in the Informe...
Alexander G. Hauptmann, Michael J. Witbrock
SPIRE
1999
Springer
13 years 9 months ago
Cross-Domain Approximate String Matching
Approximate string matching is an important paradigm in domains ranging from speech recognition to information retrieval and molecular biology. In this paper, we introduce a new f...
Daniel P. Lopresti, Gordon T. Wilfong
PG
1999
IEEE
13 years 9 months ago
A Speech Driven Talking Head System Based on a Single Face Image
In this paper, a lifelike talking head system is proposed. The talking head, which is driven by speaker independent speech recognition, requires only one single face image to synt...
I-Chen Lin, Cheng-Sheng Hung, Tzong-Jer Yang, Ming...
ICMCS
2000
IEEE
90views Multimedia» more  ICMCS 2000»
13 years 9 months ago
Towards a Multimodal Meeting Record
Face-to-face meetings usually encompass several modalities including speech, gesture, handwriting, and person identification. Recognition and integration of each of these modalit...
Ralph Gross, Michael Bett, Hua Yu, Xiaojin Zhu, Yu...
CIKM
2001
Springer
13 years 9 months ago
Towards Speech as a Knowledge Resource
Speech is a tantalizing mode of human communication. On one hand, humans understand speech with ease and use speech to express complex ideas, information, and knowledge. On the ot...
Eric W. Brown, Savitha Srinivasan, Anni Coden, Dul...
ICMI
2003
Springer
166views Biometrics» more  ICMI 2003»
13 years 10 months ago
Georgia tech gesture toolkit: supporting experiments in gesture recognition
Gesture recognition is becoming a more common interaction tool in the fields of ubiquitous and wearable computing. Designing a system to perform gesture recognition, however, can...
Tracy L. Westeyn, Helene Brashear, Amin Atrash, Th...