Sciweavers

35 search results - page 4 / 7
» Audio fingerprinting to identify multiple videos of an event
Sort
View
SEMCO
2008
IEEE
15 years 5 months ago
Disambiguating Sounds through Context
A central problem in automatic sound recognition is the mapping between low-level audio features and the meaningful content of an auditory scene. We propose a dynamic network mode...
Maria E. Niessen, Leendert van Maanen, Tjeerd C. A...
ICMI
2003
Springer
93views Biometrics» more  ICMI 2003»
15 years 3 months ago
A multi-modal approach for determining speaker location and focus
This paper presents a multi-modal approach to locate a speaker in a scene and determine to whom he or she is speaking. We present a simple probabilistic framework that combines mu...
Michael Siracusa, Louis-Philippe Morency, Kevin Wi...
ICPR
2006
IEEE
15 years 11 months ago
Activity Discovery from Surveillance Videos
Multi-agent interactions often result in mutual occlusion sequences which constitute a visual signature for the event. We define six qualitative occlusion primitives based on the ...
Amitabha Mukerjee, K. S. Venkatesh, Pabitra Mitra,...
CVPR
2007
IEEE
16 years 18 days ago
Harmony in Motion
Cross-modal analysis offers information beyond that extracted from individual modalities. Consider a camcorder having a single microphone in a cocktail-party: it captures several ...
Zohar Barzelay, Yoav Y. Schechner
MM
1993
ACM
126views Multimedia» more  MM 1993»
15 years 2 months ago
CMIFed: A Presentation Environment for Portable Hypermedia Documents
as a tree which specifies the presentation in an abstract, machineindependent way. This specification is created and edited using an authoring system; it is mapped to a particula...
Guido van Rossum, Jack Jansen, K. Sjoerd Mullender...