Sciweavers

57 search results - page 10 / 12
» A multimodal approach to music transcription
Sort
View
MLMI
2004
Springer
13 years 11 months ago
Shallow Dialogue Processing Using Machine Learning Algorithms (or Not)
This paper presents a shallow dialogue analysis model, aimed at human-human dialogues in the context of staff or business meetings. Four components of the model are defined, and ...
Andrei Popescu-Belis, Alexander Clark, Maria Georg...
ICASSP
2010
IEEE
13 years 6 months ago
Transcription-based video genre classification
In this paper, we present a new method for video genre identification based on the linguistic content analysis. This approach relies on the analysis of the most frequent words in...
Stanislas Oger, Mickael Rouvier, Georges Linares
MM
2009
ACM
169views Multimedia» more  MM 2009»
14 years 15 days ago
Visual speaker localization aided by acoustic models
The following paper presents a novel audio-visual approach for unsupervised speaker locationing. Using recordings from a single, low-resolution room overview camera and a single f...
Gerald Friedland, Chuohao Yeo, Hayley Hung
TASLP
2008
115views more  TASLP 2008»
13 years 5 months ago
Recognition of Dialogue Acts in Multiparty Meetings Using a Switching DBN
Abstract--This paper is concerned with the automatic recognition of dialogue acts (DAs) in multiparty conversational speech. We present a joint generative model for DA recognition ...
Alfred Dielmann, Steve Renals
COMSIS
2010
13 years 3 months ago
Multi-video summarization using complex graph clustering and mining
Multi-video summarization is a great theoretical and technical challenge due to the wider diversity of topics in multi-video than singlevideo as well as the multi-modality nature o...
Jian Shao, Dongming Jiang, Mengru Wang, Hong Chen,...