Sciweavers

ICCV
2011
IEEE

Ask the locals: multi-way local pooling for image recognition

12 years 4 months ago
Ask the locals: multi-way local pooling for image recognition
Invariant representations in object recognition systems are generally obtained by pooling feature vectors over spatially local neighborhoods. But pooling is not local in the feature vector space, so that widely dissimilar features may be pooled together if they are in nearby locations. Recent approaches rely on sophisticated encoding methods and more specialized codebooks (or dictionaries), e.g., learned on subsets of descriptors which are close in feature space, to circumvent this problem. In this work, we argue that a common trait found in much recent work in image recognition or retrieval is that it leverages locality in feature space on top of purely spatial locality. We propose to apply this idea in its simplest form to an object recognition system based on the spatial pyramid framework, to increase the performance of small dictionaries with very little added engineering. Stateof-the-art results on several object recognition benchmarks show the promise of this approach.
Y-Lan Boureau, Nicolas Le Roux, francis bach, Jean
Added 11 Dec 2011
Updated 11 Dec 2011
Type Journal
Year 2011
Where ICCV
Authors Y-Lan Boureau, Nicolas Le Roux, francis bach, Jean Ponce, Yann LeCun
Comments (0)