Sciweavers

34 search results - page 5 / 7
» Fusion of audio and visual cues for laughter detection
Sort
View
MM
2006
ACM
152views Multimedia» more  MM 2006»
13 years 11 months ago
Multimodal fusion using learned text concepts for image categorization
Conventional image categorization techniques primarily rely on low-level visual cues. In this paper, we describe a multimodal fusion scheme which improves the image classification...
Qiang Zhu, Mei-Chen Yeh, Kwang-Ting Cheng
ICASSP
2011
IEEE
12 years 9 months ago
Improving acoustic event detection using generalizable visual features and multi-modality modeling
Acoustic event detection (AED) aims to identify both timestamps and types of multiple events and has been found to be very challenging. The cues for these events often times exist...
Po-Sen Huang, Xiaodan Zhuang, Mark Hasegawa-Johnso...
AAAI
2008
13 years 8 months ago
Unstructured Audio Classification for Environment Recognition
My thesis aims to contribute towards building autonomous agents that are able to understand their surrounding environment through the use of both audio and visual information. To ...
Selina Chu
ICMCS
2008
IEEE
178views Multimedia» more  ICMCS 2008»
14 years 6 days ago
Automatic character identification in feature-length films
This paper presents a novel approach to automatically identify characters in films using audio visual cues and text analysis. The approach consists of three stages: (i) frontal f...
Yifan Zhang, Changsheng Xu, Hanqing Lu
PAA
2006
13 years 5 months ago
Audio-visual sports highlights extraction using Coupled Hidden Markov Models
We present our studies on the application of Coupled Hidden Markov Models(CHMMs) to sports highlights extraction from broadcast video using both audio and video information. First,...
Ziyou Xiong