Combining multiple classifiers is of particular interest in multimedia applications. Each modality in multimedia data can be analyzed individually, and combining multiple pieces of...
The paper presents our initial attempts in building an audio video emotion recognition system. Both, audio and video sub-systems are discussed, and description of the database of s...
Rok Gajsek, Vitomir Struc, Simon Dobrisek, Janez Z...
In this paper, we introduce a first-order probabilistic model that combines multiple cues to classify human activities from video data accurately and robustly. Our system works in...
Abstract— In human-robot communication it is often important to relate robot sensor readings to concepts used by humans. We believe that access to semantic maps will make it poss...
Martin Persson, Tom Duckett, Christoffer Valgren, ...
In this paper, two multimodal systems for the tracking of multiple users in smart environments are presented. The first is a multiview particle filter tracker using foreground, c...