Sciweavers

CIVR
2006
Springer

Annotating News Video with Locations

13 years 8 months ago
Annotating News Video with Locations
Abstract. The location of video scenes is an important semantic descriptor especially for broadcast news video. In this paper, we propose a learning-based approach to annotate shots of news video with locations extracted from video transcript, based on features from multiple video modalities including syntactic structure of transcript sentences, speaker identity, temporal video structure, and so on. Machine learning algorithms are adopted to combine multi-modal features to solve two sub-problems: (1) whether the location of a video shot is mentioned in the transcript, and if so, (2) among many locations in the transcript, which are correct one(s) for this shot. Experiments on TRECVID dataset demonstrate that our approach achieves approximately 85% accuracy in correctly labeling the location of any shot in news video.
Jun Yang 0003, Alexander G. Hauptmann
Added 20 Aug 2010
Updated 20 Aug 2010
Type Conference
Year 2006
Where CIVR
Authors Jun Yang 0003, Alexander G. Hauptmann
Comments (0)