Abstract This paper presents an approach to designing and implementing extensible computational models for perceiving systems based on a knowledge-driven joint inference approach. ...
In this paper, we present a novel multi-modal framework for semantic event extraction from basketball games based on webcasting text and broadcast video. We propose novel approach...
With the advent of prosody annotation standards such as tones and break indices (ToBI), speech technologists and linguists alike have been interested in automatically detecting pro...
Sankaranarayanan Ananthakrishnan, Shrikanth S. Nar...
In this paper, we tackle the problem of understanding the temporal structure of complex events in highly varying videos obtained from the Internet. Towards this goal, we utilize a...
With recent advances in motion detection and tracking in video, more efforts are being directed at higher-level video analysis such as recognizing actions, events and activities. ...