We analyze the computational problem of multi-object tracking in video sequences. We formulate the problem using a cost function that requires estimating the number of tracks, as ...
We present a distributed representation of pose and appearance of people called the “poselet activation vector”. First we show that this representation can be used to estimate...
This paper addresses the problem of learning similaritypreserving binary codes for efficient retrieval in large-scale image collections. We propose a simple and efficient altern...
Traditional computer vision and machine learning algorithms have been largely studied in a centralized setting, where all the processing is performed at a single central location....
We describe a generative model of the relationship between two images. The model is defined as a factored threeway Boltzmann machine, in which hidden variables collaborate to de...
Joshua Susskind, Roland Memisevic, Geoffrey Hinton...