Empowering Visual Categorization With the GPU

14 years 11 months ago

Download staff.science.uva.nl

—Visual categorization is important to manage large collections of digital images and video, where textual metadata is often incomplete or simply unavailable. The bag-of-words model has become the most powerful method for visual categorization of images and video. Despite its high accuracy, a severe drawback of this model is its high computational cost. As the trend to increase computational power in newer CPU and GPU architectures is to increase their level of parallelism, exploiting this parallelism becomes an important direction to handle the computational cost of the bag-of-words approach. When optimizing a system based on the bag-of-words approach, the goal is to minimize the time it takes to process batches of images. In this paper, we analyze the bag-of-words model for visual categorization in terms of computational cost and identify two major bottlenecks: the quantization step and the classiﬁcation step. We address these two bottlenecks by proposing two efﬁcient algorithm...

Koen E. A. van de Sande, Theo Gevers, Cees G. M. S

Real-time Traffic

Computational Cost | TMM 2011 | Video | Visual Categorization |

claim paper

Post Info
More Details (n/a)

Added	15 May 2011
Updated	15 May 2011
Type	Journal
Year	2011
Where	TMM
Authors	Koen E. A. van de Sande, Theo Gevers, Cees G. M. Snoek

Comments (0)

Sciweavers

Empowering Visual Categorization With the GPU

Computational Cost | TMM 2011 | Video | Visual Categorization |

Explore & Download

Productivity Tools

Sciweavers