Movie segmentation into scenes and chapters using locally weighted bag of visual words

15 years 10 months ago

Download www.cs.uoi.gr

Movies segmentation into semantically correlated units is a quite tedious task due to ”semantic gap”. Low-level features do not provide useful information about the semantical correlation between shots and usually fail to detect scenes with constantly dynamic content. In the method we propose herein, local invariant descriptors are used to represent the key-frames of video shots and a visual vocabulary is created from these descriptors resulting to a visual words histogram representation (bag of visual words) for each shot. A key aspect of our method is that, based on an idea from text segmentation, the histograms of visual words corresponding to each shot are further smoothed temporally by taking into account the histograms of neighboring shots. In this way, valuable contextual information is preserved. The ﬁnal scene and chapter boundaries are determined at the local maxima of the diﬀerence of successive smoothed histograms for low and high values of the smoothing parameter ...

Vasileios Chasanis, Argyris Kalogeratos, Aristidis

Real-time Traffic

CIVR 2009 | Histograms | Local Invariant Descriptors | Multimedia | Visual Words |

claim paper

Post Info
More Details (n/a)

Added	26 May 2010
Updated	26 May 2010
Type	Conference
Year	2009
Where	CIVR
Authors	Vasileios Chasanis, Argyris Kalogeratos, Aristidis Likas

Comments (0)

Sciweavers

Movie segmentation into scenes and chapters using locally weighted bag of visual words

CIVR 2009 | Histograms | Local Invariant Descriptors | Multimedia | Visual Words |

Explore & Download

Productivity Tools

Sciweavers