Sciweavers

MM
2006
ACM
124views Multimedia» more  MM 2006»
13 years 10 months ago
3WNews: who, where, and when in news video
We describe 3WNews as a novel system for browsing news video by the people (who) and locations (where) appearing in the footage as well as the time (when) of news events. The peop...
Jun Yang 0003, Alexander G. Hauptmann
MM
2006
ACM
168views Multimedia» more  MM 2006»
13 years 10 months ago
Scalability of local image descriptors: a comparative study
Computer vision researchers have recently proposed several local descriptor schemes. Due to lack of database support, however, these descriptors have only been evaluated using sma...
Herwig Lejsek, Friðrik Heiðar Ásmun...
MM
2006
ACM
96views Multimedia» more  MM 2006»
13 years 10 months ago
Takashi's seasons
Takashi’s Seasons is a sequential live shadow puppet/video performance in which a number of interpretations of the four seasons are performed by an artist. Controlled with fishi...
Takashi Kawashima, Togo Kida, Yoshimasa Niwa
MM
2006
ACM
330views Multimedia» more  MM 2006»
13 years 10 months ago
Visual attention detection in video sequences using spatiotemporal cues
Human vision system actively seeks interesting regions in images to reduce the search effort in tasks, such as object detection and recognition. Similarly, prominent actions in v...
Yun Zhai, Mubarak Shah
MM
2006
ACM
145views Multimedia» more  MM 2006»
13 years 10 months ago
Interactive mosaic generation for video navigation
Navigation through large multimedia collections that include videos and images still remains a hard problem. In this paper, we introduce a novel method to visualize and navigate t...
Kihwan Kim, Irfan A. Essa, Gregory D. Abowd
MM
2006
ACM
157views Multimedia» more  MM 2006»
13 years 10 months ago
Taking sides: dynamic text and hip-hop performance
In this paper we describe Taking Sides, a performance using a real-time speech visualization software system called TextEngine. Taking Sides is a collaboration between our researc...
Jason Lewis, Yannick Assogba
MM
2006
ACM
175views Multimedia» more  MM 2006»
13 years 10 months ago
Real-time automatic 3D scene generation from natural language voice and text descriptions
Automatic scene generation using voice and text offers a unique multimedia approach to classic storytelling and human computer interaction with 3D graphics. In this paper, we pre...
Lee M. Seversky, Lijun Yin
MM
2006
ACM
114views Multimedia» more  MM 2006»
13 years 10 months ago
Eye/gaze tracking in web, image and video documents
Chabane Djeraba, Stanislas Lew, Dan A. Simovici, S...
MM
2006
ACM
151views Multimedia» more  MM 2006»
13 years 10 months ago
Choreographic buttons: promoting social interaction through human movement and clear affordances
We used human movement as the basis for designing a collaborative aesthetic design environment. Our intention was to promote social interaction and creative expression. We employe...
Andrew Webb, Andruid Kerne, Eunyee Koh, Pranesh Jo...
MM
2006
ACM
175views Multimedia» more  MM 2006»
13 years 10 months ago
The challenge problem for automated detection of 101 semantic concepts in multimedia
We introduce the challenge problem for generic video indexing to gain insight in intermediate steps that affect performance of multimedia analysis methods, while at the same time...
Cees Snoek, Marcel Worring, Jan van Gemert, Jan-Ma...