This paper proposes a probabilistic search algorithm to boost the computational efficiency of face detection in video sequences. The algorithm sequentially predicts the probabili...
Atsushi Matsui, Simon Clippingdale, Takashi Matsum...
Abstract. Cooking is a human activity with sophisticated process. Underlying the multitude of culinary recipes, there exist a set of fundamental and general cooking techniques, suc...
Widespread current cameras are part of multisensory systems with an integrated computer (smartphones). Computer vision thus starts evolving to cross-modal sensing, where vision an...
In this paper, we present a Deformable Action Template
(DAT) model that is learnable from cluttered real-world
videos with weak supervisions. In our generative model,
an action ...
Existing research on news video analysis mainly concentrates on structure analysis, semantic concept detection, annotation and search. However, little work has been contributed to...