In traditional peer-to-peer search networks, operations focus on properly labeled files such as music or video, and the actual search is often limited to text tags. The explosive ...
This paper proposes a probabilistic search algorithm to boost the computational efficiency of face detection in video sequences. The algorithm sequentially predicts the probabili...
Atsushi Matsui, Simon Clippingdale, Takashi Matsum...
Abstract. Cooking is a human activity with sophisticated process. Underlying the multitude of culinary recipes, there exist a set of fundamental and general cooking techniques, suc...
Widespread current cameras are part of multisensory systems with an integrated computer (smartphones). Computer vision thus starts evolving to cross-modal sensing, where vision an...
In this paper, we present a Deformable Action Template
(DAT) model that is learnable from cluttered real-world
videos with weak supervisions. In our generative model,
an action ...