Sciweavers

CIVR
2009
Springer

Movie segmentation into scenes and chapters using locally weighted bag of visual words

13 years 11 months ago
Movie segmentation into scenes and chapters using locally weighted bag of visual words
Movies segmentation into semantically correlated units is a quite tedious task due to ”semantic gap”. Low-level features do not provide useful information about the semantical correlation between shots and usually fail to detect scenes with constantly dynamic content. In the method we propose herein, local invariant descriptors are used to represent the key-frames of video shots and a visual vocabulary is created from these descriptors resulting to a visual words histogram representation (bag of visual words) for each shot. A key aspect of our method is that, based on an idea from text segmentation, the histograms of visual words corresponding to each shot are further smoothed temporally by taking into account the histograms of neighboring shots. In this way, valuable contextual information is preserved. The final scene and chapter boundaries are determined at the local maxima of the difference of successive smoothed histograms for low and high values of the smoothing parameter ...
Vasileios Chasanis, Argyris Kalogeratos, Aristidis
Added 26 May 2010
Updated 26 May 2010
Type Conference
Year 2009
Where CIVR
Authors Vasileios Chasanis, Argyris Kalogeratos, Aristidis Likas
Comments (0)