The most expressive way humans display emotions is through facial expressions. In this work we report on several advances we have made in building a system for classification of f...
Ira Cohen, Nicu Sebe, Yafei Sun, Michael S. Lew, T...
This paper presents a multi-modal approach to locate a speaker in a scene and determine to whom he or she is speaking. We present a simple probabilistic framework that combines mu...
Michael Siracusa, Louis-Philippe Morency, Kevin Wi...
In several video surveillance applications, such as the detection of abandoned/stolen objects or parked vehicles, the detection of stationary foreground objects is a critical task...
Abstract. Many multimedia presentation applications involve retrieval of objects from more than one collaborating server. Presentations of objects from different collaborating serv...
Motion estimation is the most time-consuming subsystem in a video codec. Thus, more efficient methods of motion estimation should be investigated. Real video sequences usually exh...