This paper presents a method for visual object categorization based on encoding the joint textural information in objects and the surrounding background, and requiring no segmenta...
Alireza Tavakoli Targhi, Andrzej Pronobis, Heydar ...
In this paper, we address the problem of 3D articulated multi-person tracking in busy street scenes from a moving, human-level observer. In order to handle the complexity of multi-...
Stephan Gammeter, Andreas Ess, Tobias Jaeggli, Kon...
Media forensics tries to determine the originating device of a signal. We apply this paradigm to microphone forensics, determining the microphone model used to record a given audio...
Kernel descriptors provide a unified way to generate rich visual feature sets by turning pixel attributes into patch-level features, and yield impressive results on many object rec...
Liefeng Bo, Kevin Lai, Xiaofeng Ren and Dieter Fox
We present algorithms for automatic feature selection for unsupervised structure discovery from video sequences. Feature selection in this scenario is hard because of the absence ...