Sciweavers

115 search results - page 1 / 23
» Information Access using Speech, Speaker and Face Recognitio...
Sort
View
ICMCS
2000
IEEE
134views Multimedia» more  ICMCS 2000»
13 years 9 months ago
Information Access using Speech, Speaker and Face Recognition
We describe a scheme to combine the results of audio and face identification for multimedia indexing and retrieval. Audio analysis consists of speech and speaker recognition deri...
Mahesh Viswanathan, Homayoon S. M. Beigi, Alain Tr...
ICASSP
2010
IEEE
13 years 4 months ago
Visual emotion recognition using compact facial representations and viseme information
Emotion expression is an essential part of human interaction. Rich emotional information is conveyed through the human face. In this study, we analyze detailed motion-captured fac...
Angeliki Metallinou, Carlos Busso, Sungbok Lee, Sh...
SEMCO
2009
IEEE
13 years 11 months ago
Enhanced Multimedia Content Access and Exploitation Using Semantic Speech Retrieval
—Techniques for automatic annotation of spoken content making use of speech recognition technology have long been characterized as holding unrealized promise to provide access to...
Roeland Ordelman, Franciska de Jong, Martha Larson
ICASSP
2011
IEEE
12 years 8 months ago
Voxel-based Viterbi Active Speaker Tracking (V-VAST) with best view selection for video lecture post-production
An automated system is presented for reducing a multi-view lecture recording into a single view video containing a best view summary of active speakers. The system uses skin color...
Damien Kelly, Anil Kokaram, Frank Boland
ICASSP
2010
IEEE
13 years 4 months ago
Speaker identification by combining MFCC and phase information in noisy environments
In conventional speaker recognition methods based on MFCC, the phase information has been ignored. Recently, we proposed a method that integrated MFCC with the phase information o...
Longbiao Wang, Kazue Minami, Kazumasa Yamamoto, Se...