A video containing multiple objects in rotational and translational motion is analyzed through a combination of spatial and frequency domain representations. It is argued that the...
Abstract. Different strategies to learn user semantic queries from dissimilarity representations of video audio-visual content are presented. When dealing with large corpora of vi...
Many of the successful multimedia retrieval systems focus on developing efficient and effective video retrieval solutions with the help of appropriate index structures. In these ...
Video question answering aims to pinpoint answers in response to user's specified questions. However, most question answering technologies involve in integrating rich specifi...
Silhouette recognition can reconstruct the three-dimensional pose of a human subject in monocular video so long as the camera's view remains unoccluded by other objects. This ...