Sciweavers

35 search results - page 4 / 7
» Audio fingerprinting to identify multiple videos of an event
Sort
View
SEMCO
2008
IEEE
15 years 6 months ago
Disambiguating Sounds through Context
A central problem in automatic sound recognition is the mapping between low-level audio features and the meaningful content of an auditory scene. We propose a dynamic network mode...
Maria E. Niessen, Leendert van Maanen, Tjeerd C. A...
ICMI
2003
Springer
93views Biometrics» more  ICMI 2003»
15 years 5 months ago
A multi-modal approach for determining speaker location and focus
This paper presents a multi-modal approach to locate a speaker in a scene and determine to whom he or she is speaking. We present a simple probabilistic framework that combines mu...
Michael Siracusa, Louis-Philippe Morency, Kevin Wi...
ICPR
2006
IEEE
16 years 1 months ago
Activity Discovery from Surveillance Videos
Multi-agent interactions often result in mutual occlusion sequences which constitute a visual signature for the event. We define six qualitative occlusion primitives based on the ...
Amitabha Mukerjee, K. S. Venkatesh, Pabitra Mitra,...
105
Voted
CVPR
2007
IEEE
16 years 2 months ago
Harmony in Motion
Cross-modal analysis offers information beyond that extracted from individual modalities. Consider a camcorder having a single microphone in a cocktail-party: it captures several ...
Zohar Barzelay, Yoav Y. Schechner
MM
1993
ACM
126views Multimedia» more  MM 1993»
15 years 4 months ago
CMIFed: A Presentation Environment for Portable Hypermedia Documents
as a tree which specifies the presentation in an abstract, machineindependent way. This specification is created and edited using an authoring system; it is mapped to a particula...
Guido van Rossum, Jack Jansen, K. Sjoerd Mullender...