We present a novel framework for recognizing repetitive
sequential events performed by human actors with strong
temporal dependencies and potential parallel overlap. Our
solutio...
Labeling image collections is a tedious task, especially
when multiple labels have to be chosen for each image. In
this paper we introduce a new framework that extends state
of ...
Nicolas Loeff, Ali Farhadi, Ian Endres and David A...
We present an approach to visual tracking based on dividing a
target into multiple regions, or fragments. The target is represented
by a Gaussian mixture model in a joint feature...
In this paper, we introduce a new approach for modeling
visual context. For this purpose, we consider the leaves of a
hierarchical segmentation tree as elementary units. Each
le...
Joseph J. Lim, Pablo Arbelaez, Chunhui Gu, and Jit...
The development of user interfaces based on vision and speech requires the solution of a challenging statistical inference problem: The intentions and actions of multiple individu...