Sciweavers

551 search results - page 104 / 111
» Multimodal Speech Synthesis
Sort
View
AAAI
2004
14 years 11 months ago
SCoT: A Spoken Conversational Tutor
We describe SCoT, a Spoken Conversational Tutor, which has been implemented in order to investigate the advantages of natural language in tutoring, especially spoken language. SCo...
Karl Schultz, Brady Clark, Heather Pon-Barry, Eliz...
RIAO
2000
14 years 11 months ago
Speaker change detection using joint audio-visual statistics
In this paper, we present an approach for speaker change detection in broadcast video using joint audio-visual scene change statistics. Our experiments indicate that using joint a...
Giridharan Iyengar, Chalapathy Neti, Sankar Basu
ICIP
2004
IEEE
15 years 11 months ago
Statistical transformations of frontal models for non-frontal face verification
In the framework of a face verification system using local features and a Gaussian Mixture Model based classifier, we address the problem of non-frontal face verification (when on...
Conrad Sanderson, Samy Bengio
ICASSP
2008
IEEE
15 years 4 months ago
Unsupervised optimal phoneme segmentation: Objectives, algorithm and comparisons
Phoneme segmentation is a fundamental problem in many speech recognition and synthesis studies. Unsupervised phoneme segmentation assumes no knowledge on linguistic contents and a...
Yu Qiao, Naoya Shimomura, Nobuaki Minematsu
ICMCS
2006
IEEE
166views Multimedia» more  ICMCS 2006»
15 years 3 months ago
Towards Robust Intuitive Vision-Based User Interfaces
In future videocommunication services, the user’s communication device, such as PC, laptop, PDA or mobile phone is equipped with new interaction modalities. These can be cameras...
Oliver Schreer, Peter Eisert, Peter Kauff, Ralf Ta...