Sciweavers

TMM
2008

Multimodal News Story Clustering With Pairwise Visual Near-Duplicate Constraint

13 years 3 months ago
Multimodal News Story Clustering With Pairwise Visual Near-Duplicate Constraint
Story clustering is a critical step for news retrieval, topic mining, and summarization. Nonetheless, the task remains highly challenging owing to the fact that news topics exhibit clusters of varying densities, shapes, and sizes. Traditional algorithms are found to be ineffective in mining these types of clusters. This paper offers a new perspective by exploring the pairwise visual cues deriving from near-duplicate keyframes (NDK) for constraint-based clustering. We propose a constraint-driven co-clustering algorithm (CCC), which utilizes the near-duplicate constraints built on top of text, to mine topic-related stories and the outliers. With CCC, the duality between stories and their underlying multimodal features is exploited to transform features in low-dimensional space with normalized cut. The visual constraints are added directly to this new space, while the traditional DBSCAN is revisited to capitalize on the availability of constraints and the reduced dimensional space. We mod...
Xiao Wu, Chong-Wah Ngo, Alexander G. Hauptmann
Added 15 Dec 2010
Updated 15 Dec 2010
Type Journal
Year 2008
Where TMM
Authors Xiao Wu, Chong-Wah Ngo, Alexander G. Hauptmann
Comments (0)