Sciweavers

34 search results - page 4 / 7
» Fusion of audio and visual cues for laughter detection
Sort
View
ICPR
2002
IEEE
14 years 6 months ago
Boosting and Structure Learning in Dynamic Bayesian Networks for Audio-Visual Speaker Detection
Bayesian networks are an attractive modeling tool for human sensing, as they combine an intuitive graphical representation with ef?cient algorithms for inference and learning. Ear...
Tanzeem Choudhury, James M. Rehg, Vladimir Pavlovi...
CVPR
2005
IEEE
13 years 11 months ago
Audio-Visual Affect Recognition through Multi-Stream Fused HMM for HCI
Advances in computer processing power and emerging algorithms are allowing new ways of envisioning Human Computer Interaction. This paper focuses on the development of a computing...
Zhihong Zeng, Jilin Tu, Brian Pianfetti, Ming Liu,...
ICASSP
2009
IEEE
14 years 16 days ago
Audio-assisted trajectory estimation in non-overlapping multi-camera networks
We present an algorithm to improve trajectory estimation in networks of non-overlapping cameras using audio measurements. The algorithm fuses audiovisual cues in each camera’s ï...
Murtaza Taj, Andrea Cavallaro
ICASSP
2010
IEEE
13 years 4 months ago
Speech/Non-Speech Detection in Meetings from Automatically Extracted low Resolution Visual Features
In this paper we address the problem of estimating who is speaking from automatically extracted low resolution visual cues in group meetings. Traditionally, the task of speech/non...
Hayley Hung, Sileye O. Ba
ICASSP
2009
IEEE
14 years 16 days ago
Multi-modal activity and dominance detection in smart meeting rooms
In this paper a new approach for activity and dominance modeling in meetings is presented. For this purpose low level acoustic and visual features are extracted from audio and vid...
Benedikt Hörnler, Gerhard Rigoll