In this paper we investigate how a small number of high-level concepts derived for video shots, such as Sports, Face, Indoor, etc., can be used effectively for ad hoc search in vi...
The human face is one of the most important objects in videos since it provides rich information for spotting certain people of interest, such as government leaders in news video,...
Thao Ngoc Nguyen, Thanh Duc Ngo, Duy-Dinh Le, Shin...
This paper proposes a generic method for action recognition
in uncontrolled videos. The idea is to use images
collected from the Web to learn representations of actions
and use ...
Nazli Ikizler-Cinbis, R. Gokberk Cinbis, Stan Scla...
Bag-of-features (BoF) deriving from local keypoints has recently appeared promising for object and scene classification. Whether BoF can naturally survive the challenges such as ...
Abstract. We present a first system to semi-automatically create a visual representation for a given, short text. We first parse the input text, decompose it into suitable units,...
Katharina Schwarz, Pavel Rojtberg, Joachim Caspar,...