Sciweavers

12 search results - page 1 / 3
» Multimodal Speaker Segmentation in Presence of Overlapped Sp...
Sort
View
ISM
2008
IEEE
136views Multimedia» more  ISM 2008»
13 years 10 months ago
Multimodal Speaker Segmentation in Presence of Overlapped Speech Segments
We propose a multimodal speaker segmentation algorithm with two main contributions: First, we suggest a hidden Markov model architecture that performs fusion of the three modaliti...
Viktor Rozgic, Kyu Jeong Han, Panayiotis G. Georgi...
ICMCS
2005
IEEE
103views Multimedia» more  ICMCS 2005»
13 years 9 months ago
Using spatial cues for meeting speech segmentation
This work investigates the validity and accuracy of using spatial cues with Time-Delay Estimation (TDE) as a method of segmenting multichannel recorded speech by speaker location....
Eva Cheng, Jason Lukasiak, Ian S. Burnett, David S...
LREC
2008
95views Education» more  LREC 2008»
13 years 5 months ago
Annotation and analysis of overlapping speech in political interviews
Looking for a better understanding of spontaneous speech-related phenomena and to improve automatic speech recognition (ASR), we present here a study on the relationship between t...
Martine Adda-Decker, Claude Barras, Gilles Adda, P...
ICDAR
2005
IEEE
13 years 9 months ago
From Searching to Browsing through Multimodal Documents Linking
Relationships that link static documents discussed during meetings to the corresponding speech transcripts can be of various kinds. The most important ones, thematic links, quotat...
Dalila Mekhaldi, Denis Lalanne, Rolf Ingold
ICMCS
2007
IEEE
144views Multimedia» more  ICMCS 2007»
13 years 10 months ago
Analysis, User Interface, and their Evaluation for Student Presentation Videos
In the domain of candidly-captured student presentation videos, we examine and evaluate approaches for multimodal analysis and indexing of audio and video. We apply visual segment...
Alexander Haubold, John R. Kender