Spatial language video retrieval is an important real-world problem that is also a natural test bed for evaluating semantic structures for natural language descriptions of motion ...
This paper presents a max margin framework on image annotation and multimodal image retrieval as a structured prediction model. Following the max margin approach the image retriev...
Zhen Guo, Zhongfei Zhang, Eric P. Xing, Christos F...
In this paper, we present a novel multi-modal framework for semantic event extraction from basketball games based on webcasting text and broadcast video. We propose novel approach...
Space requirement for storing indexes and performance for query processing are two critical issues in music information retrieval (MIR) system. To overcome difficulties in variabl...
Many documentary videos use background music to help structure the content and communicate the semantic. In this paper, we investigate semantic segmentation of documentary video u...