This paper proposes a generic method for action recognition
in uncontrolled videos. The idea is to use images
collected from the Web to learn representations of actions
and use ...
Nazli Ikizler-Cinbis, R. Gokberk Cinbis, Stan Scla...
?We present a method to automatically localize captions in JPEG compressed images and the I-frames of MPEG compressed videos. Caption text regions are segmented from background ima...
ion when we annotate content. This therefore requires us to investigate and model video semantics. Because of the type and volume of data, general-purpose approaches are likely to ...
Easy-to-use audio/video authoring tools play a crucial role in moving multimedia software from research curiosity to mainstream applications. However, research in multimedia author...
Augmented reality (AR) provides an intuitive user interface to present information in the context of the real world. A common application is to overlay screen-aligned annotations f...