In this paper, we present a systematic framework for re-cognizing realistic actions from videos “in the wild.” Such unconstrained videos are abundant in personal collections as...
Jingen Liu (University of Central Florida), Jiebo ...
Scalable image retrieval systems usually involve hierarchical quantization of local image descriptors, which produces a visual vocabulary for inverted indexing of images. Although ...
Rongrong Ji (Harbin Institute of Technology), Xing...
We address the classic problems of detection, segmenta-
tion and pose estimation of people in images with a novel
definition of a part, a poselet. We postulate two criteria
(1) ...
Scene ow is the 3D motion eld of points in the world. Given N (N > 1) image sequences gather ed with a N-eye stereo camera or N calibrated cameras, we present a novel system wh...
This paper presents a multi-scale generative model for representing animate shapes and extracting meaningful parts of objects. The model assumes that animate shapes (2D simple clo...