We propose the use of attentional cascades based on the DCT and motion information contained in an MPEG coded stream. An attentional cascade is a sequence of very efficient class...
Digital video applications exploit the intrinsic structure of video sequences. In order to obtain and represent this structure for video annotation and indexing tasks, the main ini...
Over the last decade, the availability of public image repositories and recognition benchmarks has enabled rapid progress in visual object category and instance detection. Today we...
Kevin Lai, Liefeng Bo, Xiaofeng Ren and Dieter Fox
An automatic algorithm for indexing dialogue scenes in multimedia content is proposed. The content is segmented into dialogue scenes using the state transitions of a hidden Markov...
This paper addresses the problem of adapting a generic 3D face model to a human face of which the frontal and profile views are given. Assuming that a set of feature points have b...