Sciweavers

21 search results - page 4 / 5
» Audio-Visual Speech Fusion Using Coupled Hidden Markov Model...
Sort
View
ICMCS
2005
IEEE
173views Multimedia» more  ICMCS 2005»
13 years 11 months ago
A Multi-Modal Mixed-State Dynamic Bayesian Network for Robust Meeting Event Recognition from Disturbed Data
In this work we present a novel multi-modal mixed-state dynamic Bayesian network (DBN) for robust meeting event classification. The model uses information from lapel microphones,...
Marc Al-Hames, Gerhard Rigoll
PAMI
2010
218views more  PAMI 2010»
13 years 11 days ago
A Coupled Duration-Focused Architecture for Real-Time Music-to-Score Alignment
Abstract--The capacity for realtime synchronization and coordination is a common ability among trained musicians performing a music score that presents an interesting challenge for...
Arshia Cont
ISM
2008
IEEE
136views Multimedia» more  ISM 2008»
14 years 23 min ago
Multimodal Speaker Segmentation in Presence of Overlapped Speech Segments
We propose a multimodal speaker segmentation algorithm with two main contributions: First, we suggest a hidden Markov model architecture that performs fusion of the three modaliti...
Viktor Rozgic, Kyu Jeong Han, Panayiotis G. Georgi...
ICMI
2010
Springer
172views Biometrics» more  ICMI 2010»
13 years 3 months ago
Modelling and analyzing multimodal dyadic interactions using social networks
Social network analysis became a common technique used to model and quantify the properties of social interactions. In this paper, we propose an integrated framework to explore th...
Sergio Escalera, Petia Radeva, Jordi Vitrià...
CORR
2004
Springer
195views Education» more  CORR 2004»
13 years 5 months ago
Detecting User Engagement in Everyday Conversations
This paper presents a novel application of speech emotion recognition: estimation of the level of conversational engagement between users of a voice communication system. We begin...
Chen Yu, Paul M. Aoki, Allison Woodruff